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Foreword 


The staff of the Department of Electrical Engineering at the Massa- 
chusetts Institute of Technology some twenty years ago undertook an 
extensive program of revision as a unit of its entire presentation of the 
basic technological principles of electrical engineering. By early 1943 
this collaborative enterprise had resulted in the publication of three 
volumes of a projected series. Publication of this complementary book 
of the series was delayed by the war. 

The decision to undertake so comprehensive a plan rather than to add 
here and patch there came from the belief that the Department’s large 
staff, with its varied interests in teaching and related research, could 
effect a new synthesis of educational material in the field of electrical 
engineering and evolve a set of textbooks with a breadth of view not 
easily approached by an author working individually. 

Such a comprehensive series, it was felt, should be free from the dupli- 
cations, repetitions, and imbalances so often present in unintegrated 
series of textbooks. It should possess a unity and a breadth arising from 
the organization of a subject as a whole. It should be useful to the student 
of ordinary preparation and also provide a depth and rigor challenging 
to the exceptional student and acceptable to the advanced scholar. It 
should comprise a basic course adequate for all students of electrical 
engineering regardless of their ultimate specialty. Restricted to material 
which is of fundamental importance to all branches of electrical engineer- 
ing, the course should lead naturally into any one branch. 

Such a basic synthesis, it is felt, has been appropriately achieved in the 
first three volumes. In the course of their generation it became more and 
more evident that the development of further extensions of this basic 
material covering specialized fields would correspondingly become more 
and more the responsibility of individual authorities who could relate 
their work to the basic structure. 

The four volumes and the organized program of teaching out of which 
they have grown, are thus the products of a major research project 
to improve methods of technological education. The experience gained 
through the years in this stimulating exploration, together with the 
rich background of accelerated development contributed by the circum- 
stances of war, opens the way for further evolution in this undertaking. 
Perhaps the most interesting potentialities lie in the influence of the social 
sciences and humanities on the extensions of this vital adventure. 


J. R. Killian, Jr, 




Preface 


When the writing of basic text material for the undergraduate curricu- 
lum was first undertaken by the Staff of the Department of Electrical 
Engineering we felt that this material should not merely fill the immediate 
needs of the undergraduate subjects for which it was primarily intended 
but in addition should offer stimulation and incentive for further study. 
For this reason occasional glimpses were afforded of the horizons which 
lie l^eyond the artificial boundaries set by conventional limitations in the 
treatment of technological principles. The collateral discussions engen- 
dered by these aims tended to appear digressive; moreover, they became 
so numerous as to interfere seriously with conciseness, continuity, and 
clarity. We therefore determined to relegate these supplemental treat- 
ments to an appendix. As the project developed the appendix became 
too large to justify itself; in fact, it took on the aspect of a text in itself. 
Thus evolved the concept of companion or reference volumes for collateral 
study. 

Later, when we evaluated this concept as to scope and organization, 
we concluded that the supplemental material would be more useful if the 
discussions of a purely mathematical nature were collated and separated 
from the applications to be predicated thereon. This book is the result 
of this effort to avoid redundance and to attain a logical unification. The 
applications themselves, then, were to appear as further reference volumes 
of the series depending for their foundations on The Mathematics of 
Circuit A nalysis. 

In the course of these efforts war intervened. Through its exigencies 
there was an unprecedented accelerat .-on of scientific and technological 
effort, compressing into a few years Avhat normally would have taken 
decades to achieve. Necessarily the revision program was held in abeyance 
during this critical period with the result that the Department of 
Electrical Engineering is now reconsidering the course revision program 
and modifying its prewar plans. The basic series must be reappraised in 
the light of the vast wartime developments. This may well lead to the 
addition of material to the earlier basic texts. Moreover, there must be 
drastic reconsiderations of the supplemental volumes which will form 
extensions of the basic series and complement this particular reference 
volume. Since at the moment this book goes to press the Department is 
in the midst of reevaluating the extensive revision program, it is impos- 
sible here to make precise pronouncements either as to the character or 
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number of volumes to follow. Although thus far it has been the practice 
to publish the volumes of the series without naming collaborators, since 
the immediate volume came about largely through the inspiration of 
Professor Guillemin, and since it was written entirely by him under the 
trying pressures of war, it has been decided to recognize him as author. 

As to the subject of this volume it may be well to emphasize that a 
mathematical textbook written by engineers is to be looked upon as an 
idea conveyor without claim to rigor. The discussions given herein should 
be regarded as being plausibility arguments rather than proofs. The 
primary purpose is to stimulate interest and lay a background of general 
understanding upon which the student may later build more carefully. 
In no wise therefore is the character of presentation here given to be 
looked upon as a substitute for the more formal rigorous treatment of 
the mathematician. 

Although other books of this nature have been written, in the planning 
of this book we felt that a rather complete assemblage of mathematical 
topics, needed specifically or collaterally in the analysis and synthesis of 
electrical networks and in the attack on field problems related to trans- 
mission lines, wave guides, and antennas, was still outstanding. The 
opportunity to include more detailed discussions of certain topics here- 
tofore given too little attention has been capitalized, we hojje, to the 
advantage of student and researcher alike. 

In the field of advanced algebra, for example, we believed that a 
discussion of determinants and matrices l>ccomes ever so much more 
meaningful when coupled with the geometrical interpretations provided 
by the subject of linear coordinate transformations and the closely 
related discussion of quadratic forms. Thus the first four chapters of 
this book bring together a collection of topics in advanced algebra that 
form a closely interrelated mathematical unit, and an indispensable unit 
in the foundations of circuit theory or any other field of application dealing 
with vibrations or particle dynamics. 

The fifth chapter, on vector analysis, is incorporated at this point 
since the geometrical and algebraic ideas involved are closely related to 
the foregoing material, and because the two-dimensional aspects of 
vector analysis and field theory are very helpful in lending physical 
clarity to numerous topics in complex function theory which is taken up 
next. Here there is considerably more detail than is to be found in exist- 
ing textbooks for engineers on this subject. While the usual book for 
the most part is content to lay a nominal background for the methods of 
complex integration plus some notions about conformal transformations 
for field-mapping purposes, here it is our aim to provide a physical and 
geometrical feeling for the properties of complex functions adequate to 
meet diverse needs in the network synthesis field. The last two articles 
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in this chajiter, for example, are specifically concerned with a detailed 
discussion of factors relevant to stability considerations and the properties 
of physical impedance functions. 

The final chapter, dealing with Fourier series and integrals, stresses 
a number of items with which the engineer is especially concerned such 
as the convergence of Fourier series, the approximation properties of its 
partial sums, singularity functions and their properties, elementary 
transform properties, evaluation of inverse transforms through complex 
integration, and their approximate evaluation through use of the saddle- 
point method. 

The Department of Electrical Engineering 

Cambridge, Massachusetts 

January, 1949 
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CHAPTER I 


Determinants 

1. Definitions and useful properties 

A discussion of the theory of determinants may be approached in a 
variety of ways. For the reader who already has an acquaintance with 
this subject and can, therefore, dispense with introductory remarks, the 
following procedure* is particularly effective since it strikes directly at 
those ideas which make the determinant a useful tool. 

A determinant is commonly written in the form 


1 

^^11 

U12 * 

’ * ^In 


A = 

U2I 

U22 * 

• * ^ 2 n 

[ 1 ] 


Onl 

an 2 * 

■ ■ flnn 



in which the vertical lines enclosing the array of elements a,* are intended 
to take the place of parentheses as an indication that these elements are 
the variables of the function Ay just as /(x) is written as a symbol for a 
function of x. 

The determinant is said to be of the «th order when it involves n rows 
and n columns, the total number of elements then being n^. The italic 
capital letter A is used as an abbreviation for the function whose elements 
are denoted by the lower case letter a. Thus, B may represent another 
determinant with the elements 6 ,jt, etc. The first index on an element 
indicates the row, the second index the column in which that element is 
situated. 

The determinant may be defined uniquely in terms of the following 
three fundamental properties: 

I. The value of the function is unchanged if the elements of any row. 
(column) are replaced by the sums of the elements of that raw 
(column) and the corresponding ones of another row (column); 
for examplcy if an, ai2, • • • ain are replaced by (an + aai), 
(ai2 + a32), • • * (ain + aan). 

II. The value of the function is multiplied by the constant k if all the 
elements of any raw or column are multiplied by k. 

III. The value of the function is unity if all the elements an the principal 
diagonal, that is, an, a22, * * * ann, others are zero. 

•C. Carath^odory, Vorlesungen uber redU Funktionen (Leipzig, 1918), Ch. VI. 
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To these three fundamental properties may be added the following 
derived ones: 

IV. The first fundamental property may be amplified to the effect that 
an arbitrary factor times the elements of any row or column may be 
added to (or subtracted from) the corresponding elements of another 
row or column. 

V. The algebraic sign of the function is rrcersed when any two roivs ir 
columns are interchanged. 

VI. The value of the function is zero if all the elements of a row or column 
are zero, or if the corresponding elements of any two row s or columns 
arc identical or have a common ratio. 


Rule IV may be seen to follow from I and II. As shown in the numerical 
example below, the elements of the third column are first multiplied by k\ 
the resulting ^-multiplied elements arc then added to the resiK'ctive ones 
of the first column, after which column three is mulliplicHl by thus 
restoring to its elements their original values. 


1 3 2 


1 3 2k 


(1 + 2k) 3 2k 

4 2 6 

kA = 

4 2 6k 


(4 6k ) 2 6k 

3 1 7 


3 1 Ik 

1 

(3 -j- 7k) 1 7k 


A = 


(1 + 2k) 3 2 
(4 + 6k) 2 6 
(3 + lk) \ 1 


[ 2 ] 


Rule V is a consequence of I and the extended form IV of II. Thus, 
suppose column 1 is first added to column 3, next the resultant column 3 
is subtracted from column 1, and, finally, this resulting first column is 
added to the resultant column 3. The net effect is to interchange columns 1 
and 3 and prefix all the elements of the first colunrm with minus signs, 
as illustrated below: 



1 3 2 


1 3 (2 + 1) 

A = 

4 2 6 

= 

4 2 (6 + 4) 


3 1 7 


3 1 (7 + 3) 



-2 3 (2 + 1) 


-2 3 1 

= 

-6 2 (6 + 4) 

= 

-6 2 4 


-7 1 (7 + 3) 


-7 1 3 


The first part of rule VI follows from the proi:)erty II for k = 0, and the 
second part is seen to be true on account of IV because a row or column 
of zeros is obtained when, for a suitably chosen factor, the ^-multiplied 
elements of one of the proportional rows or columns are subtracted from 
the respective elements of the other row or column. 
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2. Evaluation of numerical determinants 

The properties discussed above may be applied to the numerical 
evaluation of determinants, as is best illustrated by the foDowing numeri- 
cal example. Let 

112 

^= 4 2 6 [4] 

3 1 7 

Sttp /. Subtract from the second row the 4-multiplied elements of the 
first row: 

1 3 2 

A = 0 -10 -2 [5] 

3 1 7 

Step 2. Subtract from the third row the 3-multiplied elements of the 
first row: 

1 3 2 

A= 0 -10 -2 [6] 

0 -8 1 

Step 3. Subtract from the third row the ^multiplied elements of the 
second row: 

1 3 2 

A = 0 -10 -2 [7] 

0 0 ^ 

Step 4. Subtract from the second column the 3-multiplied elements of 
the first column : 

1 0 2 

A = 0 -10 -2 [8] 

0 0 ^ 

Step S. Subtract from the third coliunn the 2-multiplied elements of 
the first column: 

1 0 0 

A = 0 -10 -2 [9] 

0 0 

Step 6. Subtract from the third column the ^multiplied elements of 
the second column : 

1 0 0 

= 0 -10 0 

0 0 Jj*- 


[ 10 ] 
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Application of the fundamental properties II and III then gives 
1 0 0 

A = (1)(-10)(Y) 0 10= (l)(-10)(iga) ^ -26 [11] 

0 0 1 

It is useful to note that the modifications involved in the last three 
steps of this process do not influence the values of the elements on the 
principal diagonal, the product of which is equal to the value of the 
determinant. This fact may be stated in the form of an additional derived 
property: 

V Vlti The value of the special determinant in triangular farm: 

y ' 

ail ( 2|2 • • • Oin 

0 022 ^23 * ’ ■ 

= 0 0 a33 • • ain [12] 


I 0 0 0 • • • tfnn I 

is given by the prodtut (ana 22 * * * ^nn) of the elements on its 
principal diagonal. 

With the help of this rule the value of the determinant in the above 
example may be set down after the completion of the third step, 
j " If in the determinant A the rows and columns are interchanged, the 
I values of the elements on the principal diagonal are not aflTected; and if 
I the above operations with respect to rows are then replaced by the same 
operations with respect to corresponding columns and vice versa, the 
same final value is evidently arrived at. This fact demonstrates the 
equivalence of rows and columns as far as the value of a determinant is 
concerned. For convenience in reference this is stated as the property: 

VIII. The value of a determinant is unchanged if its rows are written 
as corresponding columns or vice versa. 

In numerical work, the method of evaluation illustrated in the above 
example is short and convenient to apply. When an analytic result is 
desired, however, other methods are usually preferable. They are given in 
Arts. 4 and 5, to which the discussion immediately following serves as an 
introduction. 

3. Minors and cofactors 

If in the determinant A of Eq. 1, one or more rows and a corresponding 
number of columns are deleted, the remaining square array of elements is 
again a determinant. It is referred to as the (n — ^)-rowed minor (or 
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minor determinant) of A, where the integer p denotes the number of rows 
or the corresponding number of columns which have been deleted. Thus 
the «-rowed minor is the determinant itself. An (n — l)-rowed minor is 
also spoken of as a first minor, an (n — 2)-rowed one as a second minor, 
etc* 

The minor is customarily denoted by a symbol whose indexes refer to 
the canceled rows and columns. Thus the minor Mik is formed by cancel- 
ing the ith row and the ^th column in ^4 . It is quite conunon to speak of 
Mik as the minor of or as the minor corresponding to the element a, it, 
although (according to the immediately following discussion) it should 
more properly be referred to as the complement of a, a. 

A minor of second order, denoted by Ma, is formed by canceling the 

T$ 

xth and rth rows and the ith and 5th columns. The extension of this 
notation to the designation of minors of higher order is readily recognized, 
but when the number of canceled rows and columns is large (cases of this 
sort are infrequent in engineering applications), such notation becomes 
awkward and is usually replaced by some other expedient which seems 
more effective at the moment. 

The elements which lie at the intersections of the canceled rows and 
columns, arranged in a square array in the same order (from left to right 
and from top to bottom) as they appear in the original determinant, form 
another minor determinant N which is called the complement of M. The 
complement of a first minor is a single element; that of a second minor is 
a two-rowed determinant, etc. 

In particular, the minors formed by canceling the same rows as columns’ 
(these intersect on the principal diagonal) are called principal minors, and 
their complements are again principal minors. 

An alternative view may be taken with regard to the formation of 
minors. Instead of obtaining the minors by canceling rows and columns 
in the original determinant, they may be formed by first selecting from the 
determinant certain rows, and subsequently selecting from this rectangu- 
lar (nonsquare) array any like number of columns. Or, from a given set 
of columns in the determinant ^4, minors may be formed by the selection 
of corresponding numbers of rows. The minors thus formed are evidently 
the complements of those obtained by the process of canceling the same 
combinations of rows and columns. 

If M is a minor of the determinant A and N is its complement, and if 
the rows and columns contained in M are formed from the f, 7, • • • th 

rows apd the g, r, 5, • • • th columns of ^4, then 

jj X [13] 


•The terms **minor of first order,” “second order,” etc., are also used. 
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is referred to as the algebraic complement of M. This differs from the 
ordinary complement only in its algebraic sign. If the sum of the indexes 
referring to the (first, second, etc.) rows and columns of A contained in M 
is an even integer, the algebraic sign, by Eq. 13, is +1; if this sum is an 
odd integer, the sign is — 1 . 

The relation between a minor and its complement is evidently a mutual 
one in the sense that the two designations may be interchanged. Whereas 
M may be called a minor and N its complement, N may be looked upon 
as the minor and M as its complement. Thus, a single clement may be 
thought of as a one-rowed minor. If the element is a.jt, its complement is 
the minor 3/,*. 

The algebraic complement of the single element anc is sufficiently 
important to deser\'e a special name and symbol. It is called the cofactor 
of Oik and is quite commonly denoted by the corresponding upjKT-case 
letter mth like subscripts (although various other notations are also 
encountered in the literature). Thus, the cofactor of is given by 

Aa = [ 14 ] 

It differs from the minor (which is the complement of <!,>) only in algebraic 
sign; hence the cofactor is sometimes referred to as the sigfied minor. 

The indexes i and whose integer values determine the sign-controlling 
factor (—1)*“*"^, refer respectively to the row and column intersecting at 
the point where the element a^ is located. If the cofactor is formed for an 
element in the original determinant, its indexes and those appearing in 
the sign-controlling factor obviously agree with the indexes appearing 
on the element in question, because the indexes on an element of the 
original determinant indicate respectively its row and its column positions. 
This correspondence is, however, no longer consistently true in a minor 
of the original determinant. 


For example, the minor M 2 Z 

of yl, 

,Eq. 

1, reads 



dll 

ai2 

ai4 

a, 5 • • • 

^In 


^31 

^32 

<3^34 

<^5 * * * 

<^3n 

II 

CO 

^41 

^42 

044 

a45 • • • 

a 4 n 



<^n2 


^n6 * 

^nn 


Here the element a 32 , for instance, is located at the intersection of the 
second row and the second column. This is referred to as the (2,2) position 
in the minor determinant of Eq. 15. In general, the term (r,^) position is 
used to indicate the location at which the rth row and ^th column of a 
given rectangular array intersect. The object of the present argument is 
to point out as a typical case that in forming the cofactor for the element 
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fl 32 for the minor determinant of Eq. IS, the sign-controlling factor is 

(_ 2 ) 2+2 1 ) 3+2 

If only the algebraic signs of the cofactors are set down at the positions 
of the corresponding elements in a rectangular array, the following 
checkerboard of + and — signs is obtained : 


+ 

— 

+ 

_ . . 

- 

-i- 

- 

-1- 

+ 

- 

-1- 

_ . . 

— 

+ 

— 

-1- •• 


[16] 


This pictorial statement for the signs of the cofactors is sometimes referred 
to as the “checkerboard rule/’ 


4. Laplace's development 

In the manipulation of determinants, and sometimes to facilitate their 
numerical evaluation, a process of development formulated by Laplace 
is frequently useful. It may be stated in the following form: 

// all the minors arc formed from a selected set of rows or columns of a 
determinant and the products of these mimrs with their respective al- 
gebraic complements are added ^ the resulting sum is equal to the deter- 
minant. 


If a single row is selected, this development reads 


A = OiiAii -f U/2-4,2 + • * • 4- OinAin 

ii=i, 2, 

• • • ») 

[17] 

For a single column, the result is written 

A = Uu/lu + (i2k^^2k + • • * + dnkAnk 

(k = 1, 2, 

•••n) 

[18] 


In Eq. 17 the determinant is represented by the sum of the products of 
the elements of any row with their respective cofactors. In Eq. 18 a 
corresponding summation is carried out with respect to the elements and 
cofactors of any column. This simplest form for the Laplace development, 
which is also called an expansion of the determinant along one of its rows 
or columns, is the one most frequently used. 

^ It may be of interest, however, to iUustrate a more complicated example 
of this type of development. Let the following fourth-order determinant 


ail 

ai2 

<3^13 

014 

^21 

a22 

023 

024 



®33 

<*34 

a4i 

0^2 

<*43 

<*44 


[19] 


be developed through the selection of the first two columns for the forma- 
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tion of minors. All possible two-rowed minors are systematically formed 
as the rows: 1, 2; 1, 3; 1, 4; 2, 3; 2, 4; 3, 4 are selected from these columns. 
The sign-controlling factors of the corresponding algebraic complements 
are respectively: 

(^ 1 ) 1 + 2 + 1+2 ^ 

(_l)l+2-fl+3 ^ 

= +1 

= -1 

(. 1 ) 1 + 2 ^ 3+4 ^ 

With the terms written down in this order, the development reads 


dll 

a 12 

1 

X 

<*33 

<*34 


<*11 

<* I 2 

X 

<*23 

<*24 

<121 

<*22 


<*43 

<*44 


<*31 

<*32 


<*43 

<*44 


<*12 

X 

<*23 

<*24 

+ 

1 

<*21 

<*22 

X 

<* I 3 

<*14 

<*41 

<*42 i 

^ \ 

<*33 

<*34 

<*31 

<*32 

<*43 

<*44 

<*21 

<*22 

X 

<*13 

<*14 

+ 

i 

<*31 

<*32 

X 

<* I 3 

<*14 

<*41 

<*42 


<*33 

<*34 

<^41 

<*42 

<*23 

<*24 


By means of the Laplace development a determinant may evidently be 
evaluated in a variety of ways. One possible method of evaluation consists 
in repeatedly applying the simplest form of development given by Eqs. 1 7 
and 18. In the ^t step of this process, the development is given by the 
sum of n terms, each of which is the product of an element and an (/» — 1 )- 
rowed cofactor. In the second step, each of these cofactors is similarly 
developed, thus yielding for the determinant A a sum of n{n — 1) terms, 
each of which consists of the product of two elements and an (n — 2)- 
rowed cofactor. As the process is continued one recognizes that the final 
evaluation of A is given by the sum of n! terms, each of which consists of 
the product of n elements. 

The determinant is, therefore, a rational integral function, homo- 
geneous, and of the nth degree in its elements. In any term of the final 
evaluated form, the appearance of the product of an element with another 
element of the same row or column is not possible. This fact is readily 
appreciated by noting in the term ai 2 Ai 2 , for example, that the cofactor 
A 12 contains no elements of the first row or second column. Hence none 
of these elements can subsequently appear in a term containing ai 2 . The 
determinant is, therefore, a linear function in the elements of any one row 
or column. 
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5. Other methods of evaluation in numerical or functional 

FORM 

The evaluation of a determinant by means of the Laplace develop- 
ment, although useful for numerous analytic investigations, is a long and 
tedious process. The solution of simultaneous linear equations by means 
of determinants, as discussed in Art. 8, is usually found in numerical 
problems to involve a larger number of component operations than a 
systematic process of elimination. This situation is true even when the 
determinant and cofactors are evaluated by the method given in Art. 2, 
although this method parallels the elimination process in its essential 
steps. 

Alternative abbreviated methods of solving such equations are given 
in Arts. 7 and 11 of Ch. II. From a broader standpoint it is well to be 
familiar with numerous processes of evaluating determinants, so that the 
particular conditions of a specific problem may be met most expeditiously. 
In this regard the following remarks may also prove useful. 

Evaluations of two of the simplest cases by means of the Laplace 
development method are written down so that their resultant forms may 
be examined. 



On 

O12 


O21 

022 

On 

^12 

Oiz 

O21 

a22 

023 

O31 

O32 

Oz 3 


= (111022 O21O12 


^11^22^33 4* fll2®23^31 + ^13^21^^32 

”" fl3lfl22 ^ I3 O22O23O11 — O23O21O12 


[ 21 ] 

[ 22 ] 


By inspection of Eq. 21 it may be said that the value of a two-rowed 
determinant is given by the product of the elements on the principal 
diagonal less the product of the elements on the conjugate diagonal 
(lower left to upper right) as indicated in the following by arrows: 


principal 

(tiagonal 

conjugate 
diagonal ^ 


021 ^ 022 ' 


^x^negative product 


X*^poeitive product 


[23] 


This is called the diagonal product rule. It is applicable in extended fonn 
to the evaluation of a three-rowed determinant. Here there are three 
positive and three negative products, the positive ones being formed by 
elements on the principal and adjacent parallel diagonals and the negative 
ones by elements on the conjugate and adjacent parallel diagonals in a 
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manner which is more easily understood if the first two columns are 
repeated so that the arrows may continue straight, thus: 


negative products 


K -s N 

I <*11 <*12 ^'^12 I 

' V » 


<*21 )*^ ^23 ^*^21 <*22 ! repeated columns 

I ">» 'v , 

t<»3l ^<*32 ^<*33^ 


[24] 






nofiitive oroduciA 


The result is seen to check with Eq. 22. 

An extension of this rule does not yield the value of fourth and higher 
order determinants, as may readily be appreciated from the fact that the 
number of terms in the final evaluation must be n!, whereas the diagonal 
product rule yields only 2n terms. If » = 4, there remain 4 ! -- 2 X 4 = 16 
terms unaccounted for after all diagonal products have been formed. 

From a more comprehensive study of determinants, it is seen that all 
the terms in the final evaluation may be found by writing down the group 
of elements on the principal diagonal 

flu fl22 ^33 * • • ^nn 

and carrying out all permutations of the first subscripts, keeping the 
second subscripts fixed, or vice versa. In either case there are as many 
different products as there are permutations of n things taken w at a time, 
which is nl 

In this process, the algebraic signs of the various terms are controlled 
by the rule that all permutations formed by an even number of inversions 
of the permuted subscripts represent positive terms, all others being 
negative. Thus for « = 4 the possible permutations are 


Even number 

oj 

inversions 

Odd number 

of inversions 

1 

2 

3 

4 

2 

1 

3 

4 

2 

3 

1 

4 

2 

3 

4 

1 

1 

3 

4 

2 

1 

3 

2 

4 

3 

1 

2 

4 

3 

1 

4 

2 

3 

4 

1 

2 

3 

2 

1 

4 

3 

2 

4 

1 

3 

4 

2 

1 

2 

1 

4 

3 

1 

2 

4 

3 

2 

4 

3 

1 

2 

4 

1 

3 

1 

4 

2 

3 

1 

4 

3 

2 

4 

1 

3 

2 

4 

1 

2 

3 

4 

2 

1 

3 

4 

3 

1 

2 

4 

3 

2 

1 

4 

2 

3 

1 
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The twenty-four terms written with these as the first or second set of 
indexes, the fixed set being 1 2 3 4, and prefixed with algebraic signs 

according to the stated rule, represent the evaluation of the fourth-order 
determinant. 


6. Bordered determinants 

A determinant of given order may be transformed into a determinant 
of higher order without changing its value, as is readily seen by applying 
the ideas of the Laplace development to the following example: 




1 

“0 

Oil 

«2 

dll ai2 


0 

1 

Pi 

P2 

021 O 22 


0 

0 

Oil 

012 



0 

0 

Chi 

O 22 


[25] 


The elements aoi Ph ^2 i^ay have any values. The process can 

evidently be varied by placing the zeros to the right or above or below the 
rectangle containing the aikS. The resulting form is referred to as a 
bordered determinant. 


7. Products of determinants 


The product of two determinants of like order can be expressed as a 
single determinant of the same order. If the two determinants are initially 
not of the same order, one of them can be bordered. In the present discus- 
sion the determinants can, therefore, be assumed to have the same order. 

The procedure for obtaining the elements of the product determinant 
is best illustrated by means of a simple example. By the Laplace develop- 
ment the transformation of the following product is justified : 


AXB^ 


an 

^21 



^11 

^21 


bi2 

^22 


Oil CLi2 — 1 0 

^21 ^22 ^ — 1 
0 0 in bi2 

0 0 621 ^22 


[26] 


According to rule IV, Art. 1, the resultant fourth-order determinant may 
be modified in the following manner without changing its value. First 
the 6n-iDultiplied elements of the first row and the ii2-multiplied ele- 
ments of the second row are added to the corresponding elements of the 
third row, giving 


an 

ai2 

-1 

0 

^21 

O22 

0 

-1 

^11 

C21 

0 

0 

0 

0 

&21 

623 


[27] 


in which 


C\\ = an^n + ^1^12 

C21 = + <^22^12 


[ 28 ] 
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The object of this transformation is to produce zeros in place of the ele- 
ments 6 ii and 612 in the fourth-order determinant of Eq. 26. Now both 
the J 2 i-niultiplied elements of the first row and the 622 -wiviltiplied elements 
of the second row are added to the corresponding elements of the fourth 
row, giving 


where 


A 


an 

012 

-1 

0 

021 

022 

0 

-1 

Cl\ 

C21 

0 

0 

C\2 

C22 

0 

0 


[29] 


C\2 = (l\\h2\ + ^^ 21^22 
^22 == ^ 12^21 ^ 22^22 


[30] 


By the method of Laplace’s development the determinant of Eq. 29 is 
simply 


AXB 


C\\ 

C21 



C \2 

Ci 2 

^22 


C21 

C22 


[31] 


the second form being obtained by an interchange of rows and columns. 
Examining the expressions for the elements c,*, as given by Eqs. 28 and 
30, it is observed that they are formed by multiplying successive elements 
in the columns of the determinant A by successive elements in the rows of 
the determinant B and adding the results, the specific columns and rows 
involved being indicated by the first and second subscripts resp)ectively 
on Ci*. TTius Cn is formed from the elements of the first column of A and 
those of the first row of B; C 12 is formed from the elements of the first 
column of A and those of the second row of 5, etc. More briefly, the c’s 
are said to be formed by multipl 3 dng the columns of A by the rows of B. 
P* Since the individual values of the determinants A and B are imchanged 
by writing their rows as columns, it is clear that the value of the product 
determinant is unaltered if its elements are formed by multiplying either 
the rows or columns of A by either the rows or columns of B, The elements 
of this resulting determinant may, therefore, be formed in any of four 
different ways. Although the individual elements thus obtained are 
dffierent, the value of the resultant determinant remains the same. 

A straightforward extension of the method used in the above example 
shows that the process of forming the elements of a determinant repre- 
senting the product of two given determinants A and B follows the same 
general rules when A and B have any order. This result is summarized 
in the statement 


On 

•••am 

X 

hi 

• • • bln 


Cn • 

• Cm 

am 

• * ann 

bnl 

• * • bnn 


Cm • 

’ * Cnn 


[ 32 ] 
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with any one of the following four relations: 


Cik = O^ilbkl + ^*2^*2 + • • • + (^itfikn ' [33] 

Cit = diibik + ^*2^24 + • * • + ainbnk [34] 

Cik = (lubhl + + • • • + O^nfika [35] 

Cik = <^ublk + (hib2k + • • • + (^nibfik [36] 


On the right-hand sides of the last four expressions, the indexes i and 
k have fixed values for any one of the c*s. Since the first index refers 
to a row and the second to a column, it is recognized that the four repre- 
sentations for Cik above correspond respectively to multiplying rows of A 
by rows of B, rows of A by columns of J5, columns of A by rows of B, and 
columns of A by columns of B. 

8. Linear Equations 

One of the most important uses for determinants is in the solution of 
linear simultaneous equations. A set of n such equations involving n 
unknowns reads 


<*11^1 + <*12*2 + • 
®2J*I + ®22*2 + • ' 

■ + ai„x„ = yi 
• + a2„Xn = y2 \ 

+ ^n2^2 + * ' 



’ + ^nn^n ~ y» J 


[37] 


The object is to determine the values in terms of the values 

yi • • * yn and the coefficients a,* which are the elements of the determinant 
of this system of equations. This determinant is given by Eq. 1. 

Let the equations in the set (Eq. 37) be multiplied successively by 
the cofactors An, A 2 U + Subsequent addition yields the 
equation 

11 +^1^21 H +®nlAnl)^l 

-h (^12^ 11 +<122-^21 + • * • +<3tn2-4nl)^ 

“b (disAii +fl23*^21 ”1” * * * 'i‘^n3^nl)^3 


+ (aini4 11 +0211-421 H hO»»i4nl)Xn=-4iiyi+i42iy2H hAniyn [38] 

Here the coefficient of Xi is recognized as the Laplace development of the 
determinant A with respect to the elements of the first column, as given 
by Eq. 18 for i = 1. Similarly, the coefficient of X 2 is seen to be the 
Laplace development, with respect to the elements of the first column, of 
a determinant which is formed from A by replacing the elements of the 
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first column by those of the second column. This coefficient, therefore, 
is the determinant 


ai2 (l\2 <^13 
U22 ^22 ^23 


. ^n2 ^»2 ^n3 


O'ln 

^2n 




[39] 


which by rule VI, Art. 1, has the value zero. 

Similarly, the coefficient of Xs is equal to the determinant A with its 
first column replaced by the third column. This is likewise zero, as are the 
coefficients of all the remaining a s. Equation v?8 is, therefore, equivalent 


to 

Axi = Aiiyi + A2iy2 + • • 

* + Aniyn 

[40] 

whence 

Aiiyi + A2iy2 + * 
= ,1 

‘ • + Aniyn 

[41] 


In like manner the solution for X 2 may be obtained by multiplying the 
equations in the set 37 by the cofactors A 12 , ^ 22 , ’ • • An 2 y respectively 
and adding the results. The coefficient of a *2 then equals ^4, and the 
remaining ones are zero. Hence there results 


X 2 = 


Auyi + A22y2 4 - • • • + Arayn 

A 


[42] 


This result may be stated in general terms by assuming Eqs. 37 to be 

multiplied respectively by the cofactors i4 u, yl 2 t, i- A „k and adding the 

results. The coefficient of xi, then equals A , and the remaining ones are 
zero, so that 

Aityi + A2ky2 + • • • + i4„*y„ 

^ X* = 2 M 

For k = 1,2, - n, this represents the desired solutions. 

The numerator of Eq. 43 is recognized as the Laplace development of a 
determinant which is formed from A by replacing its fcth column by the 
coliunn of y’s appearing on the right-hand sides of Eqs. 37. Thus the 
result, Eq. 43, may be written 
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^11 

<^i,k^i yi 


din 

021 

d2,k-l y2 


d2n . 

dnl 

fln.fc-l yn 


Onn 


Oil di2 * 

Oln 



O 21 O 22 * 

* * d2n 



dnl dn2 ' 

• • dnn 



[44] 


A statement describing this form of the solution is known as Cramer^ s 
rule. 

A significant feature in the derivation of these solutions is a recognition 
of the validity of the relation 

+ ^ 1 - 421 : + • * • + dniAnk = |q \ [45] 


which justifies the step from Eq. 38 to Eq. 40. The companion relation, 
which is established in an analogous fashion, reads 

dilAkl + (li2Ak2 -f * • • + dxnAkn = |q \ [^^ 


For i ^ k this represents the Laplace development of a determinant whose 
ith and kth rows are identical. Equations 45 and 46 may he looked upon 
as an extension of the relations expressed by Eqs. 18 and 17 respectively. 
The solutions to the set of Eqs. 37 may be written in the form 


biiyi + bi2y2 + • • 

■ • + bmyn = Xi 

^ ^2iyi d" ^22y2 4" • ’ 

' * + b2nyn = -^2 1 

bniyi + bn2y2 + * ' 

* 4" btinyn ~ ^ 


in which, according to Eq, 43, the coefficients are given by 



[48] 


In this result it is significant to note the reversal of the subscripts on A,r 
as compared with those on ftr,. 

In case the elements of the determinant A fulfill the condition 

<Hk = [49] 

the determinant is said to be symmetrical about its prindpal diagonal. 
It is clear from rule VIII, Art. 2, that the minors and cofactors of A then 
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also have this property, that is, 

Aik = Aki [SO] 


and it then follows from Eq. 48 that the determinant of the system of 
Eqs. 47 is likewise symmetrical. In that case the subscript order in Eq. 48 
is, of course, unimportant. 

Equations 37 and 47 are mutually inverse systems. The one set repre- 
sents the solution of the other. Consequently by analogy to Eq. 48 the 
coefficients of Eqs. 37 may be written 


in which 


B = 


o,t = 

B,i 

B 

[51] 

*11 

• • • *i» 

[52] 

*nl 

* * ^nn 


is the determinant of the system of Eqs. 47 and Bki its cofactors. 

Evaluating the product ABol the determinants of the inverse systems 
of Eqs. 37 and 47 by substituting Eq. 48 into Eq. 34 gives 


Cik = 


+ <^i2Ak2 + • • • + (linAkn 

A 


[53] 


Reference to Eq. 46 shows that 

_ f 1 loti = k 

" i 0 loxij^k 

Hence for the determinants of these inverse systems 


*11 • 

•• *i™ 


ail • 

* <Iln 


1 

0 

0 •• 
1 

• 0 
• 0 

.... 


A 




*nl • 

* * ^nn 


anl • 

’ ' ^nn 


0 

0 • 

• 1 


The determinants have inverse values. 


[54] 


[55] 


9. CONDraONS FOR THE EXISTENCE OF SOLUTIONS 

The conditions under which a system of simultaneous equations such as 
the set 37 can have solutions may be seen from the form of these solutions 
as expressed by Eq. 44. For arbitrary values of yi • • • yn, a necessary and 
sufficient condition for the existence of these solutions is that the deter- 
minant A shall not be zero. If is zero, in general no solutions exist. 

They may, however, still exist in case the determinant in the numerator 
of Eq. 44 is also zero, as it is if the y’s satisfy the conditions 

y< = aiUii -H 020,2 rf ' • • + «ii®<n 


[56] 
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in which ai, • an are arbitrary factors. The column of y's in the nu- 
merator of Eq, 44 is then a linear combination of the other columns of this 
determinant and can, by repeated application of rule IV, Art. 1, be re- 
duced to a column of zeros. 

When the y’s are expressed by the relations 56, the Eqs. 37 can be 
rewritten in the form 


“ «l) + ^12(^2 ~ « 2 ) + ‘ + dlni^n — Otn) = 0 

021 (xi — ai) + 022(^2 — a2) + • • • + 02n(Xn — a„) = 0 


— ai) + ^In2(^2 — ^ 2 ) + * * * + CLnn{Xn — ^n) = 0 


[57] 


A special case of this sort occurs when all the y’s are zero. Then ai = 
af 2 = ••• = «„ = 0, and Eqs. 57 become identical with Eqs. 37 for 
yi = y 2 = * * • = >n = 0* This is called the corresponding homogeneous 
set of equations. For these, the Cramer rule as expressed by Eq. 44 yields 
the solutions in indeterminate form except when A 0, but then the 
solutions are all zero. They are spoken of as the trivial solutions because 
their existence is at once evident upon inspection of the homogeneous 
equations. 

Nontrivial solutions to the homogeneous set of Eqs. 57 exist only if the 
determinant is zero, but Cramer’s rule, Eq. 44, is of no use in determining 
them. In order to see how this difficulty might be overcome, it is helpful 
to consider the Eqs. 37 for the special case that one of the y’s, for example, 
alone is different from zero. Then, if it is assumed for the moment that 
the determinant is not zero and Cramer’s rule or Eq. 43 is applied, it is 
found that 




A 


[58] 


In this special case the ratio of any two unknowns is given by 

Xr _ Air 

X, Ait 


[59] 


which is independent of the values of both y,- and the determinant A. 
It may be inferred, therefore, that Eq. 59 holds also when both y< and 
A are zero. 

The correctness of this conclusion is demonstrated in a rigorous fashion 
in Art. 7, Ch. III. Meanwhile it is interesting to note that when the 
homogeneous set of equations has nontrivial solutions, these are not 
uniquely determined by Eq. 59, in which only the ratios of the unknowns 
are given. Any value can be assigned to one of them, and the remaining 
unknowns are then expressed in terms of this one. 
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The general conditions lor the existence of solutions may be discussed 
as follows. The fact that the inhomogeneous equations 37 have solutions 
only when the determinant is not zero simply amounts to stating that 
these equations must be independent. If one is a linear combination of 
the others (in this case the determinant vanishes^, then, speaking in 
physical terms, the data are insuflicient to yield an explicit answer. 

In case the right-hand members of the Eqs. 37 are zero and all the equa- 
tions are independent (^4 0), the system is overspecified from a physical 

point of \new. The situation is like a deadlock, and nothing can happen; 
that is, only zero values for the unknowns can satisfy the equations. If 
one of the equations is a linear combination of the others {A = 0), this 
one may be discarded and one of the terms in each remaining equation » 
for example, that with .x*„, transposed to the right-hand side. These 
(n — 1) equations may then be solved for the (// — 1) remaining un- 
knowns in terms of .v„, provided the determinant of this reduced set is 
not zero. If it is zero, this method fails, but so does the corresponding 
form of solution expressed by Eq. 59. 

Tliis kind of failure in the method of solution indicates that two inde- 
pendent sets of solutions exist, but it is difficult to obtain a clear 
picture of this situation without the aid of such appropriate geometrical 
interpretations as are given in Ch. III. The present discussion is completed 
in that chapter. The material of the following article, however, is helpful 
in summarizing some of the characteristics of the determinant which are 
pertinent to the present problem. 

10. The rank of a determin.\nt 

If in the determinant A , Eq. 1 , there exists among the elements of each 
column the same linear relation 

Cl\aik + Oi2<hk + • ’ * + = 0 (fc = I, 2, • • • «) [60] 

in which the as are arbitrary nonzero factors, the elements of any row 
are expressible as linear combinations of the corresponding elements of the 
remaining rows. If some of the factors ax- • an are zero, this fact still 
holds for the elements of some of the rows. By repeated modification of the 
determinant according to rule IV, Art. 1, any one of these rows can be 
reduced to a row of zeros. Hence it is seen that a determinant is zero if a 
relation of the form given by Eq. 60 exists in which at least one of the a^s 
is different from zero. 

Conversely, if the determinant is known to be zero, it is surely possible 
to find a relation of the form of Eq. 60, as is clear if Eq. 60 is written out 
for all the ^-values, thus 
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^11^*1 “b a2i<x2 + * • 

* + a^nlOLn = 0 



"b ^22^2 4“ * * 

• + an20tn — 0 

► 

[ 61 ] 

+ CL2n<X2 + * ‘ 

• + annttn = 0 



This is simply the homogeneous system of equations in which the deter- 
minant has its columns written as rows. It is called the transposed set of 
equations. According to property VIII, Art. 2, the transposed determinant 
is zero if the given one is. Hence, by the method discussed in the previous 
article, a set of nonzero a- values can be found to satisfy Eqs. 61. 

It is then also possible to tind a set of nonzero jS-values satisfying the 
relations 


file'll 4- 02^i2 “}"•••+ =0 (/ = 1, 2, ■ • • w) [62] 

since they are simply a set of solutions to the untransposed system of 
homogeneous equations. 

It follows that if the elements in a row of a determinant are expressible 
as linear combinations of the corresix)nding elements of the remaining 
rows, the elements in a column of this determinant are similarly ex- 
pressible. 

r When a determinant of order n is not zero, so that neither the relations 
60 nor 62 exist (except for zero a- and /(^-values), it is said to have the rank 
r = n. If the determinant is zero but at least one of its (w ~ l)-rowed 
minors is different from zero, one set of a- and it^-values exist which 
satisfy the Eqs. 60 and 62. The determinant is then said to have the 
rank r = w — 1. If the determinant and all its {ti — l)-rowed minors are 
zero but at least one of its (w 2)-rowed minors is different from zero, it 
has the rank r = ;/ — 2. In this case it is possible to find two independent 
sets of a- and /3-values satisfying Eqs. 60 and 62, and the homogeneous 
system of equations has two independent sets of solutions, as is discussed 
further in Ch. III. Corresponding statements for the condition that the 
rank is n — 3, w — 4, etc., are obvious. Finally, if the rank of a deter- 
minant is zero, all its elements are zero. 

The definition of rank facilitates the discussion of possible solutions to 
a system of linear equations. Thus the nonhomogeneous Eqs. 37 have 
solutions only when the rank of their determinant is w. The corresponding 
homogeneous equations have nontrivial solutions only if the rank is less 
than «, and they have p sets of independent solutions if the rank \sn - p. 
A more lucid exposition of this line of reasoning is found in the chapters 
immediately following. 
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PROBLEMS 

1. Determine the rank of each of the following determinants: 


(a) 

1 

2 

3 

4 (b) 

3 

9 

20 

18 

(c) 5 

0 

-2 

9 


2 

4 

6 

8 

8 

19 

40 

37 

0 

5 

-11 

7 


3 

6 

9 

12 

13 

20 

47 

34 

-2 

-11 

25 

-19 


4 

8 

12 

16 

20 

22 

59 

31 

9 

7 

-19 

26 


(d) 12 

-5 8 

(e) 

39 

24 

12 

5 (f) 6.5 

6.5 

4 

4 

0 1 


24 

21 

2 

2 6.5 

17 

8 

-4 

3 -4 


12 

2 

10 

3 4 

8 

4 




5 

2 

3 

1 




2. Transform each of the above determinants to the triangular form, thus finding 
their values and checking the answers to Prob. 1. 

3. For each determinant in Prob. 1 whose rank is less than its order, find relations 
of the form of Eqs. 60 and 62. 

4. Using determinants (e) and (f ) of Prob. 1 . write down corresponding sets of 
simultaneous equations, denoting the right-hand members by vi, vj, • • • as in Eqs. 37. 
Solve these equations by means of Cramer's rule. 

5. Repeat the solutions of the equations in Prob. 4 by means of a systematic 
elimination process. Compare the total number of multiplications and additions with 
those required in the solutions using Cramer’s rule. 

6. Evaluate the following determinant according to the pattern shown in £q. 20. 

2 14 3 

6-12-4 
3-2 5 1 

-5 64-1 

Repeat the evaluation through reduction to the triangular form and compare the 
total numbers of multiplications and additions required in the two methods. Derive a 
formtila giving the total numbers of multiplications and of additions required for the 
evaluation of an nth order determinant by the method involving its reduction to the 
triangular form. 

7. Determine the solutions to a set of equations (like Eqs. 37) having the ac- 

0.5 0.5 0.5 0.5 

-0.866 0.289 0.289 0.289 

0 -0.258 0.408 0.408 

0 0 -0.707 0.707 

companying determinant. Compare the set of equations representing the solutions 
with the given equations and note any obvious mutual relations existing between 
these two sets of equations. 

8. Given the two sets of equations 

n • 

E aikXk ^ yi (i * 1, 2, • • • n) 

k^l 


L bk,y, --Xk « 1,2, •••«) 


and 
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which are solutions of each other, show that the corresponding determinants A and B 
have inverse values; that is^AB = 1 . A proof may be based upon the rule for forming 
the product of two determinants. 

9. In the following special nth order determinant 


Dn 


a 1 0 0 ••• 0 

1 a 1 0 ••• 0 

0 1 a 1 ••• 0 


0 0 ••• 0 1 a 


all the elements of the principal diagonal are equal to a; those on the diagonals im- 
mediately above and below the principal diagonal are unity, and all the rest are zero. 
Derive the following recursion formula: 

Z?n — OcDfi—i Dh_2 


applicable for n = 1 , 2 , • • • with the definitions: Z>o = 1 and Di - 0. From this 
recursion formula obtain an explicit expression for the determinant of order n which 
reads 


smh(n-fl)7 u-i® 

Dn = r-r » with 7 = cosh ‘ - 

smh 7 2 


10. If the first and the last elements on the principal diagonal of the determinant 
in Prob. 9 are replaced by a 2, show that the resulting determinant has the value 

Dn = sinh (n — 1)7 • sinh 7 

while if only the first or last of these elements is a/ 2 , the value is given by 


Dn = cosh ny 


11. Consider the determinant 
di 1 

-1 d2 

D = I 0 -1 

0 0 


0 0 0 0 
10 0 0 
(/3 1 0 0 
-1 d4 1 0 


K(dir--dn) 


and show that this function • • • dn), called a simple continuant ^ possesses the 
recursion formula 

A(dl, • * * dn) =* dnK{dif • • • dn— 1) + Kifily * * * dn— 2) 


with 

A(di) = di, and K{0) = 1 

12. Denote by Dn the cofactor formed through canceling the first row and column 
in the determinant given in Prob. 11. Make use of the results of Prob. 11 to show that 

D K(.du---dn) . 1| I 1.1 , Li I ■■■ 

Dll 1 <^» Uc 

which is known as a continued fraction, 

13. Show that 

Kidxr-dn) « K(dnr-di) 
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and that the recursion formula given in Prob. 1 1 may alternatively be written 

K{du • • • • • • rfn) + K{d^, • • • (fn) 

14. Show that the partial derivative of a determinant with respect to one of its 
elements equals the cofactor of that element. In symbols: dA ddtk = /I, it. 

15. Consider the cofactors .4,* and Atr corresponding to the elements a,k and a,r 
in the same row of a determinant .1. Show that the sum (.4,* + A,r) is equal to 
( — times .4,jt with the column involving the elements a,r replaced by one 
with elements (a^r + (Jsk). or to (-!)*-'"+» times A»r with the column involving the 
elements -4^* replaced by one in which the elements are (a si- + 

16. Using the iype of reasoning involved in the previous problem, show that the 
fourth-order determinant may be written as the following sum of two third-order 
determinants: 


(011022 

— <Jl 2 <J 2 l) 

023 

024 


O2I 

O22 

(O13O24 

— 014023) 

{anOz 2 

— Oi 2 <l 3 i) 

O33 

O34 


O3I 

O32 

(O13O34 

— O14O33) 

(til 1042 

— O12O41) 

O43 

O44 


O4I 

O42 

(O13O44 

- O14O43) 


or as a variety of obvious modifications of these forms. 

17. E.xpress in determinant form the condition that the three straight lines defined 
by 

dnx -f di2y + oi3 = 0 

anx + a22y + fl23 = 0 

OnX + (232y + 033 = 0 

shall intersect at a common point. 

18. In the JVK-plane the origin O, and the two points P(xi,yi) and P(.t 2 ,y 2 ) 
determine a triangle. Show that the area of this triangle is expressible by means of the 
determinant 

1 xi yi 

2 X2 >'2 

19. Using the result of the previous problem show that the area of a triangle 

determined by the points ( 0 : 2 , >’ 2 ) and (X 3 ,y 3 ) is expressible as 

1 xi yi 1 

2 ^2 >'2 1 
^ xz yz 1 

Write the condition for which these three points lie on the same straight line. 

20. ax + by + cz + d =0 is the equation of a plane. Its intercepts on the co- 
ordinate axes are: X = ~d y = -d 6, z = -d c. Let n denote the normal from 
the origin to the plane. Its direction cosines are: 


cos («,x) = 



cos (n^) » 



Since the sum of the squares of these cosines equals unity, one has 

d 


n 


y/a^ i- 
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Consider a point Po(3Co,yo;8o) for which axo + byo czq + d Dq, Subtracting this 
equation from the original one gives a(x - xq) -f b(y - yo) + c{z ~ Zq) + Z>o = 0 , 
from which it is clear that the length of the normal dropp^ from the point Pq to the 
plane is 

Do 

no == —============== 

Va^ + -f 


These results and that of the previous problem are to be made use of to show: (a) 
That the equation of a plane passing through the three points 
P2(x2,y2^2) and may be written in the form 


D(x,y,z) = 


X y z I 
xi yi zi 1 
X 2 >’2 Z2 1 
-Va ya Z3 1 


= 0 


(b) That the cofactors of the first three elements of the first row, that is, 


yi 2i 1 


Zi Xi 1 


Xi yi 1 

y2 22 1 

f 

Z2 X2 1 

> 

X2 y2 1 

ya 23 1 


Z3 X3 1 


X3 yz 1 


are equal to the projections of the area of the triangle P1P2P3 upon the F-, Z-axes 
respectively and that the square root of the sum of the squares of these cofactors 
equals the area of this triangle, (c) That Do = Z)(.ro,yo,so) divided by this square 
root equals the normal distance of a point Po(.vo,yo.2o) from the plane, and hence that 
the volume of the tetrahedron whose vertexes are the points P0P1P2P3 is equal to 
one-sixth the value of the determinant Do- 

21 . Three planes passing through the origin are represented by the equations 

(Jix -}- (i2y + U3Z = 0 
blX -h ^2y + ^32 == 0 
cix -I- Coy -f- r3S = 0 


Express in determinant form the condition for which these planes intersect in the 
same straight line and find the expressions for the direction cosines of this line. Given 
any two planes, what are the direction cosines of their intersection? 

22 . Write the following determinant as a po’ vnomial in X: 


P(X) 


dll — X di2 flia 

U 3 I ^22 ~~ X d23 

d3i d32 Uaa ~ X 


and obtain expressions for the coefficients of the polynomial in terms of the determi- 
nant A and its cofactors. Indicate the forms of these expressions for an «th degree 
polynomial. 

23. Prove that 


(a) 

(Oll + ftu) CIS Cl, 


dll C12 • • • Cln 


hii Ci% • • • Cln 


(asi + isi) Css csn 

sz 

d21 C22 • • ' C 2 n 


621 C22 • • • Cfn 


(dnl 4 * bnl) Cn2 • • • 


Otnl ^2 ’ • * Cnn 


bnl Cn 2 ••• Cnn 


(b) If Cik = ; = \-l 
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dll 

012 

Ol3 


Oil 

^12 

bit 


*11 

O12 

*13 


bn 

bit 

an 

021 

022 

023 

- 

021 

^22 

btt 

— 

6,1 

O22 

btt 


bn 

btt 

o»a 

081 

O32 

O33 


Osi 

632 

btt 



O32 

bit 


bn 

bit 


On 

O12 

613 


On 

bit 

Ol3 


bn 

O12 

dit 


bn 

bit 

b» 

021 

O22 

bss 


021 

btt 

023 

-f 

bti 

O22 

028 

- 

bn 

btt 

bn 

081 

O32 

bss 


031 

bst 

0S3 

i 

1 

bn 

0S2 

O33 


bn 

bit 

bn 


dik = dik ~ jf>ik i,k ~ 1,2,3 
\dik\ = [fin complex conjugate of |ca| 


Cik - dik -\-jbtk 1 , ^ = 1 , 2 , * • • » 7 = \/^ 

Prove that Ic^l is a complex number with the following law of formation; 

(a) |cal is equal to the sum of 2 " determinants of order w, 2 "“* being real and 
2 ""* pure imaginary. 

(b) The real determinants have an even, or zero, number of “ b ” columns. The 
imaginary' ones have an odd number of “ b ” columns. 

(c) If m is the number of “ b columns in any determinant in the expansion, the 
sign and complex character of the determinant are given by 7 ^. For a given value of m, 
there are several determinants which contain m “ b ’* columns; their number is given 
by 

n(n ~ \)(n - 2 ) » - (n - m + 1 ) 


(d) By using the above properties show incidentally that 

1 i) . ~ ^)(n - 2) 

1 H- w H — 1 


2 ! 


3 ! 


• n ! 

+ — , - 2 " 
n 1 


25. If yik = yikU), for i,k = 1,2, •••«, are single- valued, differentiable 

functions of the independent variable /, show that 


b«l = 



y'li 

yi 2 • 

•• yin 


yii 

y'i 2 • • • 

yin 


yii 

yi 2 • * 

y'in 

* 

y'ii 

3^22 • 

•• y 2 n 

+ 

yii 

y '22 • • • 

y 2 n 

+ ••• + 

yai 

y 22 •• 

y'tn 


/nl 

ynt • 

ynn 


ym 

y'nt • • • 

ynn 


ym 

yn2 • • ■ 

- y'nn 



y'li 

y'lt 

• • • y'ln 


yn 

yit 

• • • yin 


yii 

yi2 • • • yin 


y2i 

yn 

• • • y2n 


y'21 

y'ti 

' • • y'u 

+ ••• + 

y2i 

ytt • • • y2n 


ym 

y»i 

* ’ * ynn 


yni 

ym 

* * • ynn 


y'ni 

y'nt • • • y',. 


in which 


yik 


dya 

dt 


26. In terms of the n independent functions 

yjt *^*(0 for^ « l,2,--*ff 
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(differentiable up to and including the nth order), construct the following determinant 
(the so-caUed Wronskian of those functions) : 


>1 


>3 

**• yn 



yt 

/a 

• • • y'n 


y". 

y'a 

y"s 

• * • 

yt<*> 




• • • y,'”-*' 



dt 




If these functions are connected by a linear relation of the form 
A\y\ -h Aiy2 -h A^yz + • • • + Anyn = 0 

in which the i4's are constants, show that the above determinant vanishes identically. 
HitU. Differentiate the linear relation successively n - 1 times so as to obtain 

Aiy\ + • • • + Any'n 


-f ‘ 


= 0 

+ Anyn^^-^^ - 0 


Together with the original relation, one then has a set of n equations. From these, the 
value of any function y*, for example, yi, and its first (w — 1) derivatives can be 
obtained. Substitution into the Wronskian, foUowed by an expansion according to 
columns, leads to the desired result. 

27. Using the determinant, A, as defined in Prob. 26, show that 


>1 

>2 

ys 

yn 


/2 

y'a 

y'n 



yjt.-® . . . 

■ y»'”-» 

yi<"> 


yjC.) 

yM 


Him, Use the result of Prob. 25 and observe the resulting structure of the rows. 

28. If (with reference to the situation given in Prob. 27) there exists a set of n 
relationships (differential equations) of the form 

Mr'"' + + • ’ ' + = 0 


for 


r = 1,2, • 


• n 


in which the coefficients ^o, pu Pty--- pn are constant or variable, show that 


(a) 



h 

^0 


A 


(b) 




in which Ao is the integration constant. 

Him, Give r the values 1, 2, • • • « and obtain, from each equation, the value of 
y/»>. Substitute these values into the last row of the expression in Prob. 27. 

29. Given 

Xk “ 3CJk(0 forife » l,2,***n(asystemofnunknownfunctionsof0 
yk “ >4(0 for ife « 1, 2, • • • n (a system of n known functions of i) 


and 


Oifc for t, ib = 1, 2, • • • n (a collection of w- constants) 
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These quantities are related by the following system of first-order, first-degree, linear 
equations 

4- * • * + (hjsin = y* for /r = 1, 2, • • • ff 


Show that a solution of this system (the particular one) is given by 
A^fy, dt -1- Azk J*y2dt -i 


Xk 


iJik\ 


in which the A ,jt’s are cofactors of the determinant |<Ja | 
30. Show that 


1 1 I 
1 2 2 - 
1 3 3-’ 


1 

2«-i 

3n-l 


= 1 ! X 2 ! X 3 ! • • • - 1) ! 


1 n ;i- • ‘ • > 1 ”“^ 


Hint. Reduce the determinant to the diagonal form. Use Barlow’s tables of squares 
(pages 202 to 206) for the powers of integer numbers and observe the law of forma- 
tion. 

31. 


ujc = • • • Of,.) ior k = 1, 2, • • • « 


are n single-valued differentiable functions of the independent variables .ri. x^, • • • .tn. 

The “ Jacobian ” of these functions is, by definition, the following functional 
determinant: 


/U iU2 ' ’ 
\XiX2 - • • XnJ 


dui 

du\ 

dui 

dxi 

dx2 

OXn 

dti2 

dU2 

dU2 

dxi 

dx2 

dXn 

dUn 

du„ 

dUn 

dx, 

dX2 

dx„ 


Suppose the variables xi‘ • • Xn are changed to the new- independent variables 2i • • • 
according to the equations of transformation 

Xk = Xk{zi ---Zn) for ^ ^ 1, 2, • • • n 


The original Uk functions are now 

Uk = Ukizi *••«„) for ^ « 1, 2, • • • n 


and their Jacobian with respect to the variables Zi • • • Zn, for example, /i, differs from 
J only in that the variables x are replaced by corresponding z's. 
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(a) Show that the above Jacobians are connected by the relation 


dxi 

dxi 

dxi 

dzi 

dZ2 

dZn 

dX2 

dX2 

dX2 

dzi 

dZ2 

dZn 

dxn 

dXn 

dXn 

dzi 

dZ2 

dz 


(b) Extend the above result so as to consider subsequent transformations of the 
form 

= s/ri, r 2 , • * * for; =1 2, • • • « 

Tp = rp(/i. / 2 , • * • tn) for/> = 1, 2, * • • M 


(c) What happens with the last Jacobian when any intermediate functional 
determinant is identically zero? Hint. Apply the rule for differentiation which reads: 


dur __ dur dxi 
dz, i = 1 dxi dz. 


r,s = 1, 2, • • • « 


32. Let 

Qjk = .r 2 , - • Xn) for;, k = 1, 2. • • • w 

be a system of ;r differentiable functions of the independent variables xi, X 2 * * • Xn. 
Through the introduction of a new set of independent variables xi, X 2 , * • • Xn by means 
of the functional relations 

Vfc = Xjt(xi, X 2 , • • • Xn) for ^ = 1, 2, • • • n 


the system of functions ayjt in the old variables goes over into the transformed system 
Cijk in the new variables. 

Accepting the result that goes over into ajk in accordance with the law of 
transformation 




^dxk 

j=lk^\^^^dXpdXq 




prove that the determinant |a;it| is transformed as indicated by 




in which 


dxi 

dxi 

dxi 

dSi 

dSi 

dSn 

dX2 

dX2 

dX2 

dSi 

dSi 

dSn 

dXn 

dXn _ 

dXn 

dSt 

dii " 

“ din 
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33 . The expression for the three-dimensional volume element in a general system 
of co-ordinates is given by 


in which 

gjk = 2 V = 1, 2, • • • « _/ = 1, 2, • • • n it « 1, 2, • • • « 

dxj dik ^ » 

If the co-ordinate system is orthogonal, the gjk system has the property 

I 5 *^ 0 fory = k 
1=0 ioTj 9 ^ k 

Check the values of \"!j?/x| for the different co-ordinate systems and laws of co- 
ordinate transformation given in the following table: 


Name 


Cartesian 


Circular 

cylindrical 

Elliptic 

cylindrical 

Parabolic 

cylindrical 


Bipolar 

c>"lindrical 


Spheroidal 


Spherical 


Equations of Transformation 

xi = 2 i 
JC2 = X 2 
, fa = ^3 

.Ti = 2i cos f J 
jf2 = xi sin 22 
, xa = 28 

xi - c cosh 2i cos ft 
X2 = c sinh 2i sin 22 
\X3 = 23; c = const, 
i jTi = ^(2r ~ V) 

X 2 = 2 i 22 
[ X3 = 23 

; _ a sinh 2i 

j cosh 2i — cos 22 
a sin 22 

' cosh 2i — cos 22 
i X3 = 23; <2 = const, 
f xi = c cosh 2i cos 22; c = const. 
i xt - c sinh 2i sin 2 % cos 23 
[xz - c sinh 2i sin 22 sin 23 

| 2i = 2i cos 22 sin 23 
22 = 2i sin St sin 23 
28 = 2i cos 28 


vlgy^l 

1 

2 i 

c^(cosh“ 2 i — cos* 22) 

(2i* -h 22*) 

a* 

(cosh 2 1 - cos 22)* 

f^Ccosh* 2i - cos* 22 ) sinh 2i sin St 
2i* sin 28 


34 . Given the multiple integral 


I 


ff f 


F(xi, **,••• * 1 .) dxi dxf- dxn 


in which Xi, xt, ■ ■ - Xn are the independent variables. If new variables Zi, Za, ■ ■ ■ Z, 
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are introduced in accordance with the relations 

Xk ^ Xkiiiy 52, * * * 5n) for ^ = 1, 2, • • • n 
it can be shown that the above integral becomes 

/ = J* J J JF{i\,xt, -in)d3idif-d,in 

in which J is the determinant given in Prob. 32. 

Compute the value of the determinant J for each set of transformation functions 
given in the second column of the table in Prob. 33. 

35. Given the following system of m + « linear equations involving the m n 
unknowns jcx for 7 = 1, 2, • • • », and gp for p = 1, 2, • • • w, with n > m: 

#» “w P =m 

Z - «^ + L =0 for X = 1, 2, • • • « 

M-1 P=1 

M =n 

=0 for p = 1 , 2, • • • w 

(a) Write in determinant form, according to the above order of these equations, 
the condition for the existence of nontrivial solutions. 

(b) Show that this determinant is a polynomial in e of the degree » — w. 

(c) Using Laplace’s development with respect to the last m rows, show that the 
total number of f«th-order minors which can be formed is given by 

(n + m){n + m - l)(n + w - 2) • • • (« + 1) ^ ^ 

— ym < n) 


and that only 


n{n — l)(w — 2) • • • (« ~ m + 1) 


of these are not necessarily zero (the rest being identicaUy zero). 



CHAPTER II 


Matrices 

1. Linear transformations 
In Art. 8 of Ch. I, the system of linear equations 

^11^1 + + • * * + ^^In-Vn = Vl 

<hlXl + <222^2 H h a2n.V„ = >'2 


OnlXi + (i„2^2 H h annXn = >',1 

is considered to relate a set of unknown quantities Xi • • • to a set of 
known quantities >'1 • •• An alternate point of view, which plays an 
important part in the analysis of physical problems, is to look upon these 
equations as relating a set of given variables .Vi • • • .v„ to a new set of 
variables yi • • yn- The equations are said to transform the old variables 
into new ones. In this sense the set of Eqs. 1 is spoken of as a linear 
transformation. 

The transformation is characterized completely by the coefTicients 
Oik. Since not only the values of the coefficients but also their relative 
positions in the equations arc significant in this respect, a symbolic form 
for the characterization of the linear transformation is given by means of 
the rectangular array 


ail 

a\2 • 

• • ain 

021 

022 

■ ■. [ 2 ] 

aril 

an2 ’ 



which is called the matrix of the transformation. 

Offhand, it might be thought that the determinant could serve this 
purpose, but here it must be recalled that the determinant is a funetion 
' of tlie coefficients and not merely a symbolic representation of them in 
their relative orientations. The determinant has a value; the matrix, 
r on the other hand, is merely a picture and has no value other than that 
/ which it conveys by its structural composition. For the moment it has but 
one reason for existence, namely, that it is easier to write down than the 
system of Eqs. 1, yet contains essentially the same information. 

In outward appearance the determinant differs from the matrix only 
in that the latter is enclosed in square brackets whereas in the determi- 
nant the elements are enclosed between vertical lines. In this book, an 
upper-case script letter is used to denote the matrix and the upper-case 

SO 
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italic letter denotes the corresponding determinant. Thus the matrix 2 
is denoted by Ct and the corresponding determinant by A, 

Just as determinants facilitate the study of linear equations and re- 
lated problems, so the algebra of matrices is justified in that it facilitates 
the manipulation of several sets of linear transformations. The rules of 
matrix algebra arc chosen with this end in view and are otherwise quite 
arbitrary except that they must, of course, be self-consistent. 

A matrix docs not necessarily have the same numbers of rows as 
columns. Its form is therefore more flexible than that of a determinant, 
which must always be square. If the matrix has m rows and n columns, 
it is referred to as having the form m by w, or it may simply be called 
an (;««)-matrix. When m equals w, the matrix is said to be of order 
w. In the extreme cases in which the matrix consists merely of a single 
row or a single column, it is referred to as a roTi; matrix or a column 
matrix respectively. 'I’hese special forms occur frequently enough to 
warrant the use of some abbreviated notation to distinguish them. 

The notation used in this text for a row matrix is 


= [«i 


«2 




[3] 


A column matrix is indicated as shown by the equation 




^2 


[4] 




At times it becomes necessary to have the abbreviated symbol for a 
• matrix indicate the number of rows and columns involved. A matrix 
having m rows and n columns is w'ritten 




ail • • ain 


’ ’ * ^winj 


[5] 


A less specific notation, however, is sufficient when the number of rows 
and columns either is clear from the discussion involved or is immaterial^ 
If a rule for the multiplication of matrices is properly chosen, the linear 
transformation, Eq. 1, may be written as a matrix equation. One of the 
four alternative rules for forming the product of two determinants (dis- 
cussed in Art. 7, Ch. I) may be used for this purpose. It is conventional 
to choose the rule expressed by Eq. 34, Ch. I. Thus in order to form 
the product C of two matrices Q and SB, as indicated by the equation 

a X SB = e [6] 
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the elements in C arc obtained by multiplying the rows of (i by the 
columns of SB, as described in detail for the multiplication of determinants. 

With the adoption of this convention, the matrix equation replacing 
Eq. 1 reads 

fan ^12 • • • din 
U 21 022 * * * d2n 


L^nl dn2 * • * dnn^ 

which may be abbreviated as 

Q Xx] 

The element yi, which is the (l,l)-element in the column matrix y], is 
obtained by multiplying the elements in the first row of Q by the corre- 
sponding elements of the first column (the only column) of x] and adding 
the results. This gives 

dnXi + ai2X2 + - • + ainXn ^ yi [ 9 ] 

which is the first equation in the set 1. Similarly, y 2 which is the (2,1)- 
element in the matrix y], is formed from the second row of Of and the 
first column of x], thus 

^21^1 + 022 X 2 -h • * • 4- a2nXn = >’2 [lO] 

This is the second equation in the set 1. The remaining equations in this 
set are similarly contained in the matrix equation 7 or 8. 

It should be noted that the column matrix y] has no elements for the 
position (1, 2), (1, 3) • • • or (2, 2), (2, 3) • • • , etc., because it has only 
one column — the first column. For the missing elements, or zero elements 
in y], the matrix product expressed by Eq. 7 automatically yields zeros 
according to the adopted product-forming rule because the column 
matrix x] has no second, third, • • • , etc., columns; that is, the elements 
in these missing columns of the matrix x] are all zero. 

The present argument requires for completeness a statement of the 
rule that two matrices are equal only when all corresponding elements 
are equal. Specifically, if two matrices 9 and St with elements pr» and 
qrt are to be equal, that is, if 

iP = a [11] 

it is necessary that 

Pf ~ [12] 

for all values of the indexes r and s. For example, the left-hand side of 
Eq.9 is the (l,l)-element of the prdduct of 0- and *] in Eq. 7, and yi. 





X2 


y2 





[7] 


= y] 


[ 8 ] 
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the right-hand side of Eq. 9, is the (l,l)-element of the matrix y] in 
Eq. 7. This rule for the equality of two matrices is, hgwever, so obvious 
that it hardly needs emphasis. 

It should be clearly recognized that the objective of obtaining a matrix 
equation equivalent to the transformation 1 could have been met just 
as well by the adoption of any one of the other three methods of forming 
elements in the product matrix according to the rules of multiplying 
determinants and by suitable modification of the form of the matrix 
equation. The choice of the rule by which the rows of the first matrix 
in the product are multiplied by the columns of the second is arbitrary, 
but once it is adopted, consistency requires that it be looked upon as the 
rule. There are no alternatives as in the multiplication of determinants, 
because the alternative procedures, if applied to the example illustrated 
by Eq. 7, do not all yield the same final set of equations. 

That the adopted rule for matrix multiplication is not compatible with 
the commutative law in forming algebraic products is likewise apparent. 
Thus if Q and are two matrices, the product (J X S is different (and 
has a different manipulative significance) from the product X Q. 
Thus in general 

Q X SB 5^ X ff [13] 


in matrix multiplication. 

The utility of being able to write a transformation in the compact 
matrix form is appreciated when several transformations are to be carried 
out in tandem. In addition to the transformation 1 from the variables 
Xi • • • Xn to the variables yi • • • yn, it may be necessary to transform 
subsequently to a set of variables si • • • Zn by means of the equations 

huji + 612^2 H + bmyn = Zi 

^2iyi + h2y2 + • • • + 32nyn = 22 

hn\y\ + bn2y2 H + Knyn = 2n 



with the transformation matrix 




[15] 


With the additional definition of the column matrix 



[ 16 ] 
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this transformation takes the compact form 

X v] = z] 


[17] 


The direct transformation from the variables Xj • * • to the final 
variables is obtained by substitution of the expression for y] 

according to Eq. 8 into Eq. 17, which gives 

X fl X .v] = z] [18] 


or, more compactly, 


C X .v] = z] 


[19] 


in which the resultant transformation matrix C is given by the product 

e = X (? [20] 


The convenience of the matrix method of handling transformations 
becomes clear if it is contrasted with the additional labor involved when 
a specific example of the above transformation is carried through by 
means of the usual substitution process. The individual transformations 
may be assumed to be the following simple ones: 

UiiXi -f = yi I r2|-| 

^21-^*! + ^ 22-^'2 = >’2 J 


and 


+ ihiya — 1 

^2iyi + ^ 22}' 2 = ^2 J 


[ 22 ] 


Substitution of Eqs. 21 into Eqs. 22 gives 

b\i(a\iXi + <2i 2-^’2) + ^12(<^21-^’l “b <^22'^2) = ] 

^2l(^ll*^‘l + <^12^*2) "b ^22(^21-^*1 + ^ 22 ^ 2 ) = -2 j 

Multiplying out and collecting terms with Xi and .V 2 , one obtains 

(^11^11 + ^^12^21)^1 4 - ( bnai 2 + 612^22)^2 = 2i 1 

(^ 21^11 + ^ 22 ^ 21)^1 + {b2iai2 + b22(^22)^2 = ^2 j 


[23] 


[24] 


The coeflScients in these equations are recognized as the elements of a 
matrix formed from the product 


^11 bi2 ^ flu (112 

J^2l b22j L^21 ^22^ 


[25] 


The ease of manipulation and circumspection resulting from the use 
of the corresponding matrix equations is proved even more conclusively 
in the applications discussed in Ch. IV. Meanwhile it is necessary to 
study the rules governing various additional algebraic manipulations 
with matrices. 
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2. Addition of matrices 


By the sum of two matrices (? and is meant a resultant matrix C 
whose elements are equal to the sums of corresponding elements of Q. and 
JB. Explicitly, if 


a = 


then 


'an • • • 

^In 

a = 

1 

c 

1*^ 





J^tnl * • * 

^inn_ 


J^ml * ■ * ^wnj 

] = e = 

* (a,, 

1 + ^ii) • 

* (<^ln 4" hin) 


_ (^ml 

4 ^ml) • 

iflmn 4“ ^mn 


[26] 


[27] 


Similarly, the clifTerence (2 — yields a matrix whose elements are 
the dilTerences of corresponding elements of Q and ffi. 

Evidently the matrices Q and ffl should have the same number of rows 
and columns, or the missing rows and columns of one may be regarded 
as composed entirely of zeros. 


3. Multiplication by a factor 

If a matrix is multiplied by a factor, each element of the matrix 
becomes multiplied by this factor; thus 


“<711 • 



'kail ■ 

• kain 

m 1 






As is clear from the fact that if the factor k is zero, the matrix must be 
zero, wliich is true only when all elements of the matrix are zero. 

It should be observ ed that this rule is distinctly different from the 
corresjwnding one for determinants, in that the determinant becomes 
multiplied by a factor when the elements of only one row or coliunn are 
multiplied by this factor. 


4. Multiplication of matrices 

The operation of multiplication is introduced in Art. 1 above, but 
several additional remarks are necessary to complete the discussion. 

*"The individual matrices entering into the product Q X may not have 
the same number of rows as columns, but it is dear from the manner in 
which this product is carried out that the number of columns in d (the 
number of elements in a row of (J) must equal the number of rows in S 
(the number of elements in a column of SB); otherwise some of the 
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products of elements in the rows of Q. with corresponding elements in the 
columns of ffl cannot be completed, for want of either one or the other of 
the corresponding elements. For example, when expansion of the follow- 
ing product. 


an ai 2 Uia"! ^ fftn ^12! 

a^i 022 ^23 J L^2i ^22 J 


[29] 


is attempted, it is found that there are no elements in the matrix fB by 
which the elements Uia and 023 may be multiplied. These matrices are 
said to be noticonformable. They become conformable when a third row is 
^added to the matrix SB. This row may, of course, consist entirely of zeros. 

Curiously enough, conform ability does not require that the number of 
rows in fl be equal to the number of columns in SB. Thus the formation 
of the product 


dn 

.^21 


aj2 

022 


dVA 

023 



[30] 


is considered straightforward, it being tacitly understood that the matrix 
SB is interpreted as equivalent to 


bn 0' 
621 0 
.^31 0 _ 


[31] 


but never that it is equivalent to 

"0 bn 
0 ^21 
.0 631 


[32] 


It is useful to observe that the matrix formed from the product Q X SB 
has as many rows as fl and as many columns as SB. For example, the 
product 30 yields a matrix with two rows and one column. In general, 
if the matrices are denoted more specifically as [a^n] and [fcnp], then 


[^mn] X [^np] [^mp] 


[33] 


These matrices are conformable because Q has n columns and SB has n 
rows. 

Similarly the product of three conformable matrices reads 

[^mn] X [6np] X [Cpq^ = [^«] [34] 


Here the resultant matrix has as many rows as the first matrix and as 
many colunms as the last one entering into the triple product. ^ 

To carry out the product of three or more matrices such as Q X SB X C, 
the opponent product G X SB may be evaluated first and the resultant 
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matrix multiplied into C, or G may be multiplied by the resultant of 
X C. In symbols this procedure is stated by the equation 

G X SB X e = (G X £B) X e = G X (S X e) [35] 


In other words, the associative law holds in matrix. mul tipli cntioD- 
THis faci; may prove useful in minimizing the amount of labor involved 
in carrying out a multiple product. Observe that the multiplication of the 
two matrices 


[flmn] X [f*np] 

involves the operations 

m X n X p multiplications 
m X {n — l)x p additions 
Consequently a triple product in the association 
([®mn] X X [Cp?] 

involves the operations 

m X n X p + m X P X q multiplications 
tnX{n — l)Xp + tttX{p — l)Xq additions 
and in the association 

\/hnn\ X ([inp] X [fpj) 


[36] 

[37] 

[38] 

[39] 

[40] 


the operations are 

nXpXq + mXnXq multiplications 
mX(^ — 1)X? + >»X(m — 1)X? additions 

As a numerical illustration let it be supposed that w = 1, « = 6, 
/» = 1, and 9 = 4. Then the operations 39 are ten multiplications and 
five additions, whereas the operations 41 amount to 48 multiplications 
and 20 additions. 

Since, as pointed out earlier, the commutative law does not hold, the 
order in which the matrices enter into the multiple product must be 
carefully observedyin Eq. 35, for example, the component product 
(G X ®) is /'w/thmtiplied by C, and the component product (£B X C) is 
/iremultiplied by G. 

In this connection it is to be noted that the distributive law is valid 
also, with the caution that attention must again be gjven to the dis- 
tinction ^tween premultiplication and postmultiplication. Two casc| 
are illus^ted in the equations 

(G + ffi)xe = Gxe-l-tBx€ 


[42] 
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and 

ex((i + ffl)=exa + exSB [43] 

It is useful to obser\'e that since the rule for matrix multiplication 
agrees with that for the multiplication of determinants, the determinant 
of the multiple product 

axffixCx - x9 [44] 

has the value 

^X5XCX - XG [45] 


5. Some special forms of square matrices 


A square matrix of the form 

Rn 0 


0 0 


0 • • 0 
0 • • • 0 


0 • • • dnn 


[46] 


in which all the elements arc zero except those on the principal diagonal, 
is called a diagonal matrix. 

If, in addition, all the diagonal elements are unity, the matrix is re- 
ferred to as the identity or unit matrix. It is designated in this book by 
the special letter ^ and has the appearance 


GU = 


1 0 0 • •• 0 

0 1 0 • • 0 

0 0 1 . • 0 


0 0 0 1 


[47] 


If the matrix of the transformation 1 is %, the equations read 

»'l = V| ] 

*2 = >’2 I 


— Jn j 


[48] 


This set is referred to as the identity transformation, since the old 
variables Xi • • • Xn and the new variables yi • ■ • yn are identical. Any 
matrix either pre- or postmultiplied by ^ is equal to itself. 

The unit matrix multiplied by a factor k becomes 


k 

0 

0 • • 

• 0 

0 

k 

0 • • 

• 0 

0 

0 

k ■ ■ 

• 0 

0 

0 

0 • • 

• k 


DC = 


[49] 
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This is called a scalar matrix because any matrix either pre- or post- 
multiplied by 3C is simply multiplied by the scalar factor k. 

When a matrix is multiplied by the diagonal matrix £2), Eq. 46, the 
results for pre- and postmultiplication differ in the following manner. 
The formation of 3) X Q has the effect of multiplying the rows of fl by 
dll, ^ 22 , ’ ' • dnn respectively, whereas in the formation of Q X 9) the 
columns of 0. are so multiplied. 

A square matrix d whose elements satisfy the conditions 

(^ki = (^ik [50] 

evidently possesses symmetry about its principal diagonal. It is, there- 
fore, called a symmetrical matrix. The elements on its principal diagonal 
are, of course, arbitrary. 

On the other hand, if the elements are subjected to the conditions 

(iki = —dik [51] 

those on the principal diagonal must be zero. In this case the matrix is 
said to be skric symmetric. 

Interchanging rows and columns has no effect upon a symmetrical 
matrix, but in the case of a skew s^Tiimetrical matrix it amounts to mul- 
tiplying by the factor —1. It is, therefore, seen that if both d and ® 
are s>Tnmetrical or both are skew sjonmetrical, (2 X £B = £B X G; that 
is, the matrix product is commutative in these special cases. 

6. Inverse, .\djoint, transposed, reciprocal, and orthogonal 

MATRICES 

The rank of a square matrix of order n is equal to that of its determi- 
nant. When the rank is ;i, the determinant is not zero, and the matrix 
is said to be nonsingiilar. The matrix is singular if its rank is less than 
that is, if its determinant is zero. 

When the matrix fl of the transforniation 1 is nonsingular the trans- 
formation may, according to the discussion of Arts. 8 and 9 of the previous 
chapter, be inverted to yield 

b\\y\ + h2y2 + • • • + bmyn = Xi 
b2\y\ + b22y2 + h binyn = X2 


bniyi + bn 2 y 2 H h inn^n == Xn 

The matrix of this inverse transformation 



~bii 

b\2 * ' 

• • h- 


ffi = 

621 

622 * ' 

' • ^ 2 n 

[53] 


J>nl 

^«2 * 

bfin^ 
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is referred to as the inverse of G. This is written 

= ffl [54] 

The matrix G evidently does not possess an inverse when it is singular. 
It isTHear that all matrices are singular in which the number of rows is 
not equal to the number of columns, because the system of equations 
^represented by such matrices does not have unique solutions. A square 
matrix does not possess an inverse if its rank is less than its order. 

"" A few additional remarks are in order regarding the inversion of a 
linear transformation as viewed from the corresponding matrix equations. 
Writing the matrix equation for the transformation 1 

Gx] = y] [55] 

one is tempted to solve this for the matrix x] by simply dividing the 
equation through by G. This form of solution is, however, meaningle.ss 
without an interpretation of the operation of division by a matrix. 

Such an interpretation is arrived at if one observes first that the ele- 
ments of the inverse matrix fB, Eq. 53. are given by Eq. 48, Ch. I, which 
reads 

b„ = [56] 

in which A^r are the cofactors of the determinant A corresponding to 
the matrix G. The result expressed by Eq. 55, Ch. I, coupled with a 
recognition of the similarity between the processes of determinant multi- 
plication and matrix multiplication then shows that 

G X G"' = G"* X G = [57] 

that is,\^the product of a nonsingular matrix with its inverse equals the 
unit matrix, Eq. 47i Hence it becomes clear that the matrix equation 
55Ts~soIved by premultiplying on both sides by G“', which gives 

G{ix] = x] = G“'y] [58] 

The operation of division by a matrix is, therefore, interpreted in terms 
of multiplication by means of the inverse matrix. 

Division by a matrix is subject to several restrictions. First of all, the 
matrix by which a matrix equation is to be divided must be nonsingular. 
Furthermore, in a matrix equation of the form 

G X 3 X C = 9 [59] 

it is not possible in a single step to divide out 3 so as to get a solution 
for G X C. 

To obtain this result, a more lengthy procedure such as the following 
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must be used. The equation may first be postmultiplied by giving 

G X S = 8 X [60] 

Next it is postmultiplied by 9r^. It then reads 

a = 8 X e-i X [61] 

Finally it is postmultiplied by C, giving the desired result 

a X C = 8 X X X C [62] 

The same result may alternatively be obtained by first premultiplying 
Eq. 59 by a”** so that it becomes 

fB X C = a-' X 8 [63] 

then premultiplying by SB”' to obtain 

e = X a-' X 8 [64] 

and finally premultiplying by a so as to have 

a X c = a X ffi-' X a-' X 8 [65] 

It should be observ-ed that obtaining the desired result in this case 
requires not only that 66 be nonsingular, but also that either G or C be 
nonsingular. If this additional condition is not met, the desired result 
cannot be obtained. 

Because of these restrictions on the process of division by a matrix, 
it is best not to use the term division at all. In matrix algebra pre- 
or postmultiplication by an inverse matrix is possible (if the given matrix 
is nonsingular), but the operation of diNision is said in general not to 
exist. 

By means of Eq. 56 the inverse of G may be written in the form 



The process of finding this inverse may be described as follows. First, 
each element in G is replaced by the quotient of its corresponding co- 
factor and the determinant of G. Second, the rows and columns in this 
resulting matrix are interchanged. This procedure evidently involves 
a large amount of labor when the order of the given matrix is high. 
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There are other methods by which the inverse of a matrix may be found. 
One of these, which is usually considerably shorter than that stated here, 
is described in the next article. Additional methods are discussed in 
Art. 11. 

When the given matrix is of the diagonal form, Eq. 46, its inverse is 
also of the diagonal form and is given by 





0 

d22 ^ 


0 • •• 0 
0 • • • 0 


p 0 • • 


[67] 


This result follows from the fact that all cofactors except those for the 
diagonal elements in £Z) are zero, and these cofactors arc 

Dkk = ^11^22 * * ' dk—i,k—ldk+l.k-\-l * * * dnn [68] 

while the determinant of 2) is 

D = d\\d22 * dnn. [69] 


Hence 


Dkk , _i 
= dkk 


[70] 


When A ^ is factored out of the inverse matrix (? ^ Eq. 66, there is 
left 


AGr^ = 



[71] 


This matrix is called the adjoint of Q. Since the adjoint contains only 
the cofactors of the elements of G, it may exist even when G is singular, 
but only if the rank of G is not less than (n — 1 ) because all the cofactors 
would otherwise be zero. 

According to Eq. 55, Ch. I, the determinant of G”^ is A'~^\ that is, 
the inverse matrix has a determinant which is the inverse of that of the 
given matrix. It is clear from Eq. 71 that the adjoint matrix Cf" of order 
n has a determinant which is A times that of G”^ Hence the determinant 


of the adjoint is * 

; that is, 





All A 21 • 

A „i 




Ai2 A 22 * 

* * An2 

= i4"~^ 

[72] 


^In A2n ' 

’ * A 




It follows that if the determinant of G is not zero, the determinant of the 
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adjoint is not zero either, whereas if the determinant of G is zero, that of 
is zero also. 

Hence if d has the rank n, then Q® also has the rank n\ but if Q has 
any rank less than (» — 1), the rank of G“ is zero because all its elements 
are zero. When the rank of Q is (» — 1), at least one of the elements of 
S'* is not zero and, therefore, its rank is at least 1. It can be shown to be 
no greater than I, as is further discussed in Art. 8 of Ch. III. 

The linear transformation 1 is for some purposes more effectively 
studied when written in the schematic form 



Xi 

^2 

X3 • 

■ * 

yi 

ail 

ai2 

^13 • 

‘ * Oin 

>'2 

a2i 

022 

023 • 

■ • 02n 

>’3 

^31 

O 32 

O33 • 

a 

CO 

i 


On2 

On3 * 

• • Ann 


which places the matrix more clearly in e\idence. The Eqs. 1 are easily 
read out of this schematic arrangement by following along the rows and 
mentally dropping the .v’s down into their proper positions, supplying 
at the same time the necessar>' equal and plus signs. It now becomes 
more evident that these equations are closely related to another set, 
namely that which is obtained when the same manner of reading is 
carried out along the columns. Written out in the normal fashion, these 
equations are 

+ ^2i 3’2 + • • • + ®niyn = 

«12>i + <^22^2 + b = ^2 


^Inyi + ^2ny2 + •••-- 



With reference to Eqs. 1, they are called the transposed set. Their 
matrix is 



Oil 

<hi ’ ' 

• Onl 

Q,= 

012 

022 • • 

* On2 


_Oin 

Oon • * 

* 


[75] 


and is called the transpose (sometimes also the conjugate) of fl. The 
transpose of Q is the matrix G with its rows and columns interchanged. 

In this book the transposed matrix is designated by the subscript t 
and the adjoint by the superscript a, as in Eqs. 75 and 71. 

The transpose of a matrix evidently exists whether that matrix is 
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singular or not and also when the number of rows is different from the 
number of columns. In particular, the transpose of a row matrix is a 
column matrix, and vice versa; that is. 


and 



[76] 

*]< = ^ 

[77] 


The transposed set of equations 74 should not be confused with the 
inverse set 52. For certain types of matrices the transpose and inverse 
(are the same, but not in general. 

In order for the inverse of a matrix to be equal to its transpose, the 
elements of that matrix must satisfy certain special conditions. It is easy 
tcTestablish these conditions. If the elements of the matrix Q are 
those of the transpose Qt are a,r, and the elements of the inverse matrix 
are given by Eq. 56. Hence the desired conditions are expressed by 

a.r = [78] 


The significance of this result may be made more evident by substitut- 
ing it into the relations expressed by Eqs. 45 and 46 of Ch. I. These 
equations, it is recalled, hold generally for the elements and cofactors 
of any determinant. Substituting Eq. 78 into Eq. 45 of Ch. I gives 




f 1 lor i = k 
[0 ioT i 9^ k 


[79] 


Substituting Eq. 78 into Eq. 46 of Ch. I yields the companion relation 


(^il^kl + (^i2^k2 4“ * * ' “I” O^in^^kn 


1 for i = k 
0 for i k 


[80] 


Equation 79 states that the sum of the squares of the elements of any 
column equals unity, but that the sum of the products of the elements 
of any column with the corresponding ones of any other column is equal 
to zero. Equation 80 expresses a similar relation with regard to the ele- 
ments in the rows of the matrix. For example, 


or 

« 21 * + 022 * + • • • + 0 » 2 * = 1 [ 81 ] 


but 


O11O2I + O12O22 + ’ * • + Oln 02 n — 0 
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or 


^12^13 + ^2^3 + • • • + ^712^113 = 0 [82] 

and so forth. 

A matrix whose elements satisfy the relation 78, and hence the relations 
79 and 80 also, is called an orthogonal matrix. The term orthogonal ’’ 
suggests the existence of a right-angle relationship. The geometrical 
interpretation which justifies the name for this particular type of matrix 
is given in the following chapter. A simple numerical illustration of an 
orthogonal matrix is 


Q = 


0.5 0.5 

-0.707 0.707 
-0.5 -0.5 


0.707“ 

0 

0.707 


[83] 


The determinant of the matrix Q is A . The determinant of its transpose 
is also A , but since the transpose of an orthogonal matrix is equal to its 
inverse, the determinant of Q must at the same time be equal to A~^. 
Hence the determinant of an orthogonal matrix must satisfy the equation 



II 

[84] 

whence 



[85] 

and 


4 = ±1 

[86] 


The conditions under which the algebraic sign is either plus or minus 
are discussed subsequently. For the matrix 83, the determinant is +1, 
as the reader may readily verify. 

The transpose of the inverse of a matrix plays a sufficiently important 
role in the subjects utilizing matrix algebra to justify for it a special 
name and designation. It is called the reciprocal matrix, and it is desig- : 
nated by an asterisk affixed to the corresponding script letter. Thus the^ 
reciprocal of Q is given by 

a* = ar' [s?] 


The elements of the reciprocal matrix are similarly designated by an 
asterisk placed on the elements ar« of G. Reference to Eq. 66 shows that 
the elements of Q* are given by 


a*f, = 



[ 88 ] 
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The same result is obtained by calculation of the inverse of the trans- 
pose, so transposition and inversion are commutative. 

The following summary* may be helpful at this point : 

For a given matrix (? with elements determinant and co- 
factors i4r*: 

The inverse of Q is with elements and determinant 

A 

The adjoint of (2 is fl® with elements A^r and determinant 
The transpose of Q is (2/ with elements a,r and determinant A, 

The reciprocal of G is G* with elements ii*rs = and determinant 

A^K 

If G is orthogonal, then G/ = G“\ drs = — - > and determinant A = zb 1. 

Also G* = G; that is, an orthogonal matrix is its own reciprocal. 

In the manipulation of matrix equations it is sometimes useful to note 
that 

(G X = % X Qt [89] 

that is, the transpose of a product is equal to the mersed product of the 
individually transposed matrices. This conclusion follows from the fact 
that the product (G X is formed by multiplying the rows of G by 
the columns of SB and then interchanging the rows and columns in the 
resulting matrix, which is the same as though the elements had been 
formed by multiplying the columns of SB (these are the rows of SB^) by 
the rows of G (these are the columns of Qt). For example, the (2,1)- 
element in the resultant matrix (G x SB)/ is formed from the first row 
in G and the second column in SB; and the (2,1 )-elemcnt in the resultant 
matrix SB/ X Qt is formed from the second row in SB/ (second column in 
SB) and the first column in G/ (first row in G), which are identical com- 
binations. 

This relationship may readily be extended to multiple products. If 
G X SB = 9 [90] 

then 

(G X SB)/ = 9/ = SB/ X G, [91] 

Next if 

Q X SB X C = 9 X e [92] 

then 

(G X SB X C)/ = (9 X C)/ = C/ X 9/ = C/ X SB/ X G/ [93] 
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In general, 

((? X a X e X • • • X JP)e = X • • • X X S/ X [94] 

The transpose of a multiple product is equal to the reversed product of 
the individually transposed matrices. 

A similar relationship also holds for the inverse of a matrix product. If 



a X ® = e 

[95] 

then from the fact that 



C X C ‘ % (unit matrix) 

[96] 

and 


a X (B X X a~‘ = % 

[97] 

it is seen that 


e-‘ = ffi-*a-i 

[98] 

Hence 


(a X (B)“' = X a~‘ 

[99] 


The inverse of a product is equal to the reversed product of the individual 
inverse matrices. 

The e.xtension to multiple products follows as before, 

(Q X a X c X • • X X • ‘ • X X X [100] 

It is clear that all the matrices must be nonsingular in order for this 
relation to apply, but this fact does not mean that a multiple product 
involving singular matrices cannot in some cases have an inverse. For 
e.xample, the comj)onent matrices in the following product are evidently 
singular because they are not square: 

[U 24 ] X [i>42] = [^ 22 ] [fOl] 

But the resultant matrix is square and may be nonsingular, in which 
case it possesses an inverse. The relation 99 is not applicable to the 
product in Eq. 101. 

Similar reasoning shows that an analogous relationship holds also for 
the adjoint of a product of two matrices 

(Q X ffi)® = X G® [102] 

as well as for the adjoint of a multiple product 

(G X S6 X e X • • • X S')" = S'® X • • • X X ffl® X G« [103] 

According to the definition (Eq. 87) of the reciprocal of a matrix, it 
follows from Eqs. 89 and 99 that 

(G X ffl)* = G* X [104] 
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The reciprocal of the product of two matrices is equal to the product 
reversed) of the individual reciprocals. The extension of this rule 
to multiple products evidently reads 

(Q X ffi X e X • • • X iP)* = Q* X X C* X • • • X JP* [105] 

The relations expressed by Eqs. 89 and 99 also show that if 9 and 2 are 
orthogonal matrices, so that 

9i = 9^^ and 2^ = 2“^ [106] 

then 

{9 X 2), = (9 X 2)“* [107] 

Hence /fe product of two orthogonal matrices is again orthogonal. 


7. Submatrices or the partitioning of matrices 


In carrying out a matrix product, subdividing or partitioning the 
matrices into smaller components is sometimes convenient. In the 
following example 

<7il ax2 I ^13 

^21 ^122 I ^3 

exffl= ---- 

<^31 ^32 I <^33 

a^l 042 I ^43, 

the so-called submatrices in Q are 



and in SB they are 



= [^3i] ^22 = [^32 ^33] 



The matrix product, Eq. 108, may be written 


axa = [“" H 

L ®21 “ 22 J LP 21 P 22 J 


and evaluated as though the submatrices were ordinaiy elements except 
that the order in which they enter into the products formed by this 
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evaluation process must be observed carefully. Such products of sub- 
matrices must, of course, subsequently be carried out by the same rule 
of matrix multiplication. 

When the original matrices are partitioned as is done in Eq. 108, 
the division of the columns of Q into subgroups must be identic^ with 
the division of the rows of £B into subgroups. Thus the subgroup of the 
first r columns of Ct must be matched by a subgroup of the first r rows of 
£8. This matching assures that all the submatrices appearing in the 
products formed during the subsequent evaluation of Eq. Ill will be 
conformable. 

If the total number of columns in Q are subdivided into more than 
two subgroups, the rows of £B must be similarly subdivided. The rows of 
Q and the columns of £B, on the other hand, may be subdivided in any 
desired manner without affecting the conformability of submatrices later 
appearing in the evaluation process. In Eq. 108 the columns of the matrix 
£B may, for example, not be subdivided at all or they may be divided 
into three subgroups instead of two. 

It is an instructive exercise to show, by means of the relations 37, 
that the total operations of multiplication and of addition involved in 
the complete evaluation of a matrix product are independent of whether 
or how the component matrices are subdivided. This fact alone might be 
regarded as an indication that the present discussion is, for practical 
purposes, wholly irrelevant except that subdivision may be convenient 
in certain specid cases. A somewhat different application of the principle 
of subdividing matrices is, however, very useful practically in the solution 
of a set of linear equations or in the inversion of a matrix. 

The subdivided matrix equation for a linear transformation is indicated 
by 

Oil (li2 I Ui3 • • • din 

d2l d22 1 ^23 * • * fhr 


^31 ^32 I ^3 • • • 


d/ll 0rn2 1 ^n3 ’ ’ * ^nn 



Denoting the submatrices by 



«12 


fdis 

L^3 
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one may write the transformation, Eq. 112, as 

anf. + a,-..?. = ,, j ^j,5^ 

«2K1 + a22f2 = ^72 J 

A possible method of solution is the following. The second equation in 
the set 115 may be premultiplied by a 22 ~^ and solved for ^ 2 ? 

^2 ~ 0^22 %2 — <^22 ^^ 21^1 

Substituting this into the first equation of set 115 gives 

(ail “ «12«22~*«2 i){i = — ai2«22~S2 [1^7] 

from which 

Jl = (an — ai2a22“*a2i)~*(’7i ““ ai2a22'"*n2) [118] 

In a similar fashion one obtains the solution for f 2 » 

{2 = (0^22 ”” «21«I1 '«12) *(^2 ““ «21<3fll~*’7l) [11^] 

As is to be expected, this is simply Eq. 118 with the subscripts 1 and 2 
interchanged. 

Writing for brevity 

=(«!!““ «12«22~*«2l) 

-1 X t ^ 20 ] 

S 2 — (a22 — «2l«ll «12) 

the inverse of the matrix Q in the transformation 112 is indicated by 

r ' I “"^1 *«12«22 ^1 


a-* = 


— ^2 * I 


The inversion of a matrix of given order is thus reduced to the inversion 
of matrices of lower order. These are the submatrices an and a 22 , and the 
resultant matrices 61 and 62 given by Eqs. 120. For an and a 22 to be 
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nonsingular, they must have the same number of rows as columns. 
The matrix (3 must, therefore, be subdivided to meet this condition, as 
is done in Eq. 112 . Even then it may happen that an or 022 is singular, 
but this situation can always be remedied by a rearrangement in the order 
of the original equations. 

The submatriccs ai2 and a2i do not need to be inverted. They may 
have any number of rows irrespective of the number of columns. The 
matrices $i and 62 are evidently square and of the same order as an 
and a22 respectively. The process is, therefore, always applicable pro- 
vided, of course, that Q is nonsingular. 

When the matrix (i has many rows and columns, an extension of the 
method may be developed which allows for further subdivision, or the 
present method may again be applied to some of the submatrices obtained 
from a preliminary subdivision. 

A numerical <.*xample may serve to illustrate the advantage of this 
method over the one suggested by Cramer’s rule. Let the matrix be 


■ 2 

3 ; 

-4 

r 

1 

-2 1 

2 

4 

3 

2 ! 

-2 

3 

_-2 

1 ! 

3 

- 1 . 


f"or the indicated subdivision. 





■-4 r 

an = 

L 1 -2j 

«12 = 

2 4_ 


■ 3 2' 


r“ 

CN 

1 

«21 = 

L-2 IJ 

a22 = 1 

L 3 -ij 


[ 122 ] 


[ 123 ] 


In this simple case the submatrices au and 0:22 may be inverted by in- 
spection, gi\ing 



[ 124 ] 


Next in order is the determination of 

_ ir-4 n fi 3-|_ir-i -lo] 

ai2a22 2 4j^L3 2J tL 14 I4J 

and 


a21«ll 


[ 125 ] 


[ 126 ] 
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Then 

« 12«22 *®21 

tt21«ll 
whence 


ir-1 -101 r 3 21 iri? -121 

- 7 Li 4 I4JXU2 .J-iU 42J ™ 


and 


Finally, 


and 



1 r 8 

51 

r 

-4 1 

1 1 

r- 

22 

281 

2 — 

71-3 

-sj 


2 4 

J = 7 

L- 

-4 

-3sJ 

r 

2 3 

1 M 

“17 

-121 

_ir 

-3 


331 

= L 

1 -2 

J~7| 

.14 

42 J 

71 

-7 

- 

56 J 



331- 


' F" 

56 — 

331 




L-7 

-56j 


57 L 

7 - 

-d 



_r 

__2 


T- 

22 

281 _ 

1 ■ 

■ 8 

"d 


3 - 

-ij- 

71 - 

-4 - 

- 35 J “ 

7 . 

.25 

28j 


[8 - 


1 

r 28 






L 25 

28j 

~ 57 

L-25 

sj 




1 

r 56 

•’•n X 

T" 

-1 - 

>01 _ 

1 

rs8 

-141 

“57 

1-7 


71 

14 

id 

57 

L 7 

loj 


[130] 


[132] 


r 28 ■ 

nxiF"' "d=- 

■-29 

-12‘ 

L-25 1 

d 71 3 8j 57 

. 32 

27. 


When Eqs. 130, 132, 133, and 134 are put together according to the form 
indicated in Eq. 121, the desired inverse is found to be 


a-' = 



*-56 

-33 

58 

-14* 

1 

7 

-3 

7 

16 

57 

-29 

-12 

28 

7 


32 

27 

-25 

8 


[135] 


The student should obtain this same result by means of Cramer’s rule 
directly in order to appreciate how considerable a saving in labor is 
afforded by the use of submatrices. 

The matrix of Eqs. 112 may be manipulated in a variety of additional 
ways which yield slight modifications in the process of obtaining the same 
end result, but there is little point in further discussing this item here 
since the fundamental principle remains the same. 
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When, after subdivision, the given matrix has the form 

®-[o‘ L] 

Eqs. 120 and 121 show that the inverse is given by 

L-'] 

On the other hand, if 

«-[l, o'] 

the inversion of Eq. 115 shows that 

o”"] 

These simple examples are readily generalized, with the result that if 



«11 

0 ■ 

• • 0 ■ 

Q = 

0 

0.22 ‘ 

• 0 


_0 

0 • 



then 


e-‘ = 

ail * 
0 

0 

“ 22 “' • 

• • 0 
• 0 


_o 

0 




"0 • 

• • 0 


a = 

0 • 

• • a-2. —1 

0 


_aHi * 

•• 0 

0 


then the inverse is given by 



'o 

... 0 

a«i * 

= 

0 

• • • «~'n-1.2 

0 



... 0 

0 


[141] 


[142] 


[143] 


8. The linear transformation of matrices 

In Art. 1 it is shown that if the variables yi • • • yn in the transformation 
1 are subjected to the further transformation 1 1, the matrix which com- 
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bines the two transformations into one is given by the matrix product 
expressed by Eq. 20. In this process the matrix Q is said to be linearly 
transformed into the matrix C. 

The matrix by means of which the transformation of Q is effected, 
may sometimes advantageously be thought of as the resultant of a 
multiple product of very simple component matrices; thus 

X • • • X X ^^2 X % [144] 

The transformation of Q is then considered to be accomplished through 
the succession of a number of simpler transformations, the first of which 
is effected by the second by and so forth. 

A matrix may be transformed by f)ost multiplication as well as by 
premultiplication, in which case the matrix effecting the transformation is 
decomposed into components according to the order indicated by 

= fBi X fBo X % X • • • X ^B,. [145] 

Here Si accomplishes the first simple transformation. So the second, 
and so forth. 

It is of interest to determine the simplest fundamental forms into 
which an arbitrary nonsingular matrix ‘B may be decomix)sed, and 
to interpret the individual transformations which they effect upon the 
form of a matrix Q. They are called elementary transformations, and the 
matrices which produce them are the elementary transformation matriecs. 
There are three types of elementary transformations, arui, corres|K)nd- 
ingly, there are three fundamental types of so-called elementary trans- 
formation matrices. 

The first t\"pe of elementary transformation amounts to an interchange 
of any two rows or columns of the matrix (i. The second t>q>c is the addi- 
tion of the elements of a row or column to the corresponding elements of 
another row or column; and the third type is the multiplication of any 
row or column of Q by an arbitrary nonzero factor. Each type of trans- 
formation may be effected through multiplying Q by a c()rres|X)nding 
type of transformation matrix. If the desired transformation is intended 
to affect the roivSj Q is /^remultiplied by the transformation matrix; if 
the columns are to be affected, Q is />(?5finultiplied by the transformation 
matrix. 

Each type of elementary transformation matrix is formed from the 
unit matrix Eq. 47, by performing upon it the same elementary 
transformation which the desired transformation matrix is intended to 
effect in the matrix fl by means of pre- or postmultiplication. Thus 
with its pth and gth rows (respectively columns) interchanged yields a 
transformation matrix (type 1) which, by means of pre- (respectively 
post-) multiplication, interchanges the pth and qih rows (respectively 
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columns) of (J. The matrix ^ with its ^th row (respectively colunm) 
added to its pi\i row (respectively column) yields the transformation 
matrix (type 2) which, by means of pre- (respectively post-) multiplica- 
tion, effects the addition of the elements of the ^th row (respectively 
column) of d to the corresponding elements of its ^th row (respectively 
column). Finally, the matrix ^ with its pih. row (respectively column) 
multiplied by a factor k yields a transformation matrix (type 3) which, 
by means of pre- (respectively post-) multiplication, multiplies the 
elements of the pih row (respectively column) of Gt by k. 

Observe that each type of elementary transformation matrix has two 
forms according to whether it is intended to effect a transformation of the 
rows or columns of Cf (by pre- or postmultiplication, respectively). The 
one form in each case is evidently the transpose of the other, and in types 
1 and 3 these two are readily seen to be identical, whereas in type 2 the 
transposed matrix merely interchanges the distinction between the 
designations p and q in the description of the preceding paragraph. 

The elementary transformation matrices are in this text denoted by 
the script letter T with subscripts intended to indicate the type. Thus 
is used to designate t>'pe 1 ; designates type 2 for the trans- 
formation of rows, whereas 3'p^qi is the designation of this type for the 
transformation of columns; and ^pxk denotes the transformation matrix 
of type 3. 

These are illustrated for the case of fourth-order matrices by the 
following examples : 




^ 2-f3 ~ 


'2+31 = 


- 


0 0 1 0 
0 10 0 
10 0 0 
0 0 0 1 ^ 

1 0 0 0 ” 
0 110 
0 0 10 
0 0 0 1 . 

“1 0 0 0 ' 
0 10 0 
0 110 
0 0 0 1 . 

“1 0 0 0 ' 
0 10 0 
0 0 k 0 
0 0 0 1 


= (5^1-3)/ (t>Tel) 


[146] 


= (3’2+3i)« (type 2 for rows) [147] 


= {^ 2 +s)t (type 2 for columns) [148] 


= (3’3x*)< (type 3) 


[149] 
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The transformation of a fourth-order matrix G by means of these is 
illustrated in the numerical examples given below: 



"0 

0 

1 

O' 


2 

4 

3 

6' 


' 4 

2 

6 

9' 


II 

X 

eo 

/ 

0 

1 

0 

0 


1 

5 

8 

7 


1 

5 

8 

7 


1 

0 

0 

0 

X 

4 

2 

6 

9 

— 

2 

4 

3 

6 



_0 

0 

0 

1, 


JO 

5 

3 

1, 


JO 

5 

3 

1_ 



■ 2 

4 

3 

6 

“ 

'0 

0 

1 

O' 


'3 

4 

2 

6' 


Gxyi.3= 

1 

5 

8 

7 


0 

1 

0 

0 


8 

5 

1 

7 


4 

2 

6 

9 

X 

1 

0 

0 

0 


6 

2 

4 

9 



JO 

5 

3 

1 


.0 

0 

0 

1_ 


.3 

5 

10 

1_ 



"1 

0 

0 

O' 


2 

4 

3 

6" 


■ 2 

4 

3 

( 
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3’2+3xa= 

0 

1 

1 

0 

X 

1 

5 

8 

7 


5 

7 

14 

16 


0 

0 

1 

0 

4 

2 

6 

9 


4 

2 

6 

9 



_0 

0 

0 

1. 


JO 

5 

3 

ij 


JO 

5 

3 

1 
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4 

3 

6‘ 

■ 

'1 

0 

0 

O' 


' 2 

7 

3 

6' 


Gx3'24^| = 

1 

4 

5 

2 

8 

6 

7 

9 

X 

0 

0 

1 

1 

0 

1 

0 

0 

= 

1 

4 

13 

8 

8 

6 

7 

9 



JO 

5 

3 

1_ 


_0 

0 

0 

1_ 


JO 

8 

3 

1 



ri 

0 

0 

o' 


■ 2 

4 

3 

6' 


r 2 

4 

3 


6 

1 

^ 3X(5) Xfl = 

0 

0 

1 

0 

0 

5 

0 

0 

X 

1 

4 

5 

2 

8 

6 

7 

9 


' 1 
20 

5 

10 

8 

30 

7 

45 


.0 

0 

0 

1 _ 


10 

5 

3 

1 _ 


JO 

5 

3 


1 

J 

* 

■ 2 

4 

3 

6' 


'1 

0 

0 

O' 


' 2 

4 

15 

6' 


flXT 3X(5) = 

1 

4 

5 

2 

8 

6 

7 

9 

X 

0 

0 

1 

0 

0 

5 

0 

0 

= 

1 

4 

5 

2 

40 

30 

7 
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10 

5 

3 

1 


0 

0 

0 

1 


10 

5 

IS 

1 



[150] 

[ 151 ] 

[ 152 ] 

[ 153 ] 

[ 154 ] 

[ 155 ] 


The transformation matrix of type 2 may be generalized from the 
form to 7p±q (with reference to either rows or columns), which 
allows the gth row or column to be subtracted from as well as added to the 
/>th row or column. It is useful to note that for this type 



— ^q±p\ 

[156] 

and since 

“ ^P±9\ 

[157] 

it follows that 

(^P±<l)i ~ ^Q±P 

[158] 

and 

(^P±q\)t = ^q±p\ 

[159] 
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For the matrices of types 1 and 3 it is clear that 


(Tp^q)^ T p^q 

[160] 

and 


{^PXk)t = T p^k 

[161] 


It is also useful to note the form of the inverse of each type of trans- 
formation matrix, thus 


T -1 - T 

^ p^q — p^q 

[162] 

= ^PTq (row or column) 

[163] 

^PXk ^ ^ PX4~* 

[164] 


According to Eqs. 160 and 162, the transpose of the matrix of type 1 
equals its inverse. This type of transformation matrix is therefore 
orthogonal. 

In the transformation of matrices it is useful to have a type of trans- 
formation matrix which effects the addition (or subtraction) of the 
/r-multiplied ^h row or column to (or from) the pth. row or column in a 
single operation. Such a matrix is e\idently the following combination 
of types 2 and 3: 


^P±kQ — ^qXk^^ X ^p±q X ^ qXk 

[165] 

^P±kq\ “ ^qXk X ^p±q\ X ^qXk~^ 

[166] 


according to whether the rows are to be affected by premultiplication or 
the columns by postmultiplication, respectively. The matrix 7p±kq or 
^p±kq\ is formed from the unit matrix % by adding (or subtracting) its 
jfe-multiplied ^h row or column to (or from) the p\h row or column, 
respectively. It may be referred to as type 23. It has the properties 

= ^P±kq\ [167] 

^p±i‘q~^ - O'" column) [168] 


A numerical illustration of the use of this matrix is 
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Q XlT 2+(4)3| = 


■ 2 4 3 6 
15 8 7 
4 2 6 9 
10 5 3 1 


'1 0 0 O' 
0 10 0 
0 4 10 
0 0 0 1 
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1 
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10 

17 
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[169] 


[170] 
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According to the theory of determinants, it is clear that the trans- 
formation matrices have determinants with the following values: 


(U-pe 1) 

Tp., = -1 


[171] 

(tJTJc 2) 

^p±q == 1 

(row or column) 

[172] 

(t>-pe 3) 

Tpxk = k 


[173] 

(type 23) 

Tp±kq = 1 

(row or column) 

[174] 


Hence transformation of the matrix Cf by means of types 2 or leaves 
the value of the determinant A unchanged; transformation by means of 
type 1 merely changes the algebraic sign of -1 ; and transformation by 
means of type 3 has the effect of multiplying the determinant A by the 
factor k. 

It should be obser\cd that an elementaiy^ transformation matrix .T 
is always square (has the same number of rows as columns), but Cf need 
not be. However, the order of fT must be so chosen that the matrices are 
conformable. If Q is not square, the correct order of CT is different in the 
cases of pre- and jx)stmultiplication. 

The three t>'pes of elementaiy^ transformation with the matrices 
and ^Spxk^ if repeatedly applied to an arbitrary matrix Cf. 
are capable of yielding the same end result as is expressed by the relation 

f? X Cf X 2 = C [175] 

in which and 2 are any nonsingular matrices. This is the .same as saying 
that any nonsingular matrix such as or 2 can be represented as a 
product of suitably chosen elementar>' matrices of t>Tpes 1, 2, and 3 
alone. 

9. Equivalenxe of matrices 

A matrix C which may be obtained from a matrix Q by means of a 
finite number of elementary’ transformations is said to be equivalent to Q. 
The equivalence of matrices is evidently a mutual relationship, since the 
elementary transformations are nonsingular and hence reversible. Thus 
if 6 can be obtained from Cf by a succession of elementary transformations, 
it follows that Q can be regained by means of elementary transformations. 
The state of equivalence of two matrices Q and C may evidently be ex- 
pressed by Eq. 175, in which (P and 2 are arbitrary nonsingular matrices. 
r It is important for later applications to recognize that equivalent 
^ matrices have the same rank. The truth of this statement may be seen to 
foUow from the fact that the rank of a matrix depends u[X)n projxjrties 
I of the corresponding determinant (the vanishing or nonvanishing of this 
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determinant and its minors), which, according to the theory of determi- 
nants, are not altered by the elementary transformations. 


10. Transformation of a square matrix to the diagonal form 

A square matrix possesses equivalent diagonal forms, any of which 
may be obtained through a linear transformation having the form of 
Eq. 175. Such a transformation may be written 

f? X a X 2 = 2) [176] 

3) being an equivalent diagonal form of 0. 

The matrices fP and S which accomplish the transformation can be 
formed from components of the combination type 23 alone (see Art. 8), 
in which case (i and ii) have the same determinant. 

The detailed process is best shown by means of a numerical example. 
'I'he one given in Art. 2, Ch. I, illustrating the numerical evaluation of a 
determinant, involves precisely the elementary transformations required 
here. The given matrix Ct may be assumed to have the determinant A 
expressed by Eq. 4, C'h. I. The matrix consists of components of the 
type their formation being described in Art. 2, Ch. I, by steps 

1, 2, and 3. Since premultiplication is required by these transformations, 
the order of the comix)nents referring respectively to these steps reads 
from right to left, thus 

’I 0 oi r 1 0 oi r 1 0 oi r 1 o o“ 

0 1 0 X 0 1 0 X -4 1 0 = -4 10 [177] 

_o - 1-^0 d L--^ 0 ij L 0 0 ij L i -i 1. 

Third step Second step First step 

Similarly, the matrix S is given by the product of components of the 
type their formation being described by steps 4, 5, and 6. Here 

lK)stmultiplication is involved. Hence theo’derof the component matrices 
corresjx)nding resiiectively to these step-^ reads from left to right. The 
desired matrix 2. is, therefore, given by 

10 Oi fl -3 

0 1 -^ = 0 1 -i 

0 0 1 J Lo 0 1 

Fourth step Fifth step Sixth step 


"1 -3 Ol ri 0 -2l 

a= 0 10x01 OX 

0 0 1 0 0 ij _ 



The transformation expressed by Eq. 176 reads 


1 0 oi ri 3 21 n -3 

-4 10X426X0 1 

i ij 13 1 7j LO 0 





[179] 


The determinant A is equal to the product of the diagonal elements. 
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It is obvious that, by means of elementary transformations of the type 
^PXky this resultant diagonal matrix may be made to go over into the 
unit matrix of like order. 

It is also possible to transform a matrix G to the diagonal form by pre- 
multiplication or postmultiplication alone. For example, after the matrix 
is reduced to the triangular form in the manner described above, one has 


[IcSO] 


The elements above the principal diagonal in this triangular matrix may 
now be reduced to zero by continuing the process of premultiplication 
with transformation matrices of the same type as the ones contained in 
Eq. 177. Thus the fourth step may be the following; 



1 

0 o' 


’1 3 2' 


'1 3 2' 

£? X G = 

-4 

1 

L 5 

1 0 

_A 1 

5 

X 

4 2 6 

3 1 7 


0 -10 -2 

.0 0 


'1 

0 

“HI 


'1 

3 

2“ 


‘l 

3 

O' 

0 

1 

0 

X 

0 

-10 

-2 

= 

0 

-10 

-2 

0 

0 

1 . 


0 

0 



0 

0 

¥J 


The fifth step becomes 


"1 

0 

0 

1 

0 “ 
-10 

1 3 

X 

'1 

0 

3 

-10 

o' 

-2 


'1 

0 

3 

-10 

1 

o o 

0 

0 

1 ^ 


0 

0 



0 

0 

¥J 


[181] 


[ 182 ] 


[ 183 ] 


The resultant transformation matrix combining the last three steps 
becomes 


and the sixth and final step reads 


"1 

1 % 

0 “ 


'1 

3 

0 ■ 

'1 

0 

o ' 

0 

1 

0 

X 

0 

-10 

0 = 

0 

-10 

0 

0 

0 

1 


0 

0 

¥J 

0 

0 

¥J 


a= 


1 0 
0 1 0 

X 

'10 0' 
0 1 H 

X 

1 

O 

0 

1 



.0 0 1. 


0 0 1 


[0 0 ij 



A 

1 

0 


-A 


■^1 
hS 
1 . 


[184] 


Sixth step Fifth step Fourth step 


which is, of course, different from the matrix 178, although not unlike 
this matrix in form. The complete transformation is accomplished by 
premultiplication of Q by the resultant transformation matrix; 


= a X S' = 


'1 


■"A 

' 1 

0 

O' 

0 

1 

n > 

-4 

1 

0 

0 

0 

1 . 

L i 

-i 

1 


H 

A if 
L i -f iJ 


[ 185 ] 
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In place of the transformation expressed by Eq. 176, one now has 

91 X a = a [186] 

It should be dear that a similar procedure, involving postmultiplication 
only, can also be used to accomplish the same result. The reader may 
carry out the detailed steps as an exercise. 

In the above numerical example, the rank of the given matrix is equal 
to its order; that is, the matrix is nonsingular. When the rank of G is less 
than its order, some of the diagonal elements in 9) are found to be zero, 
but the method of transformation given in Eq. 176 is still applicable. 

From the definition of the rank of a square matrix (which is the same as 
the rank of its determinant) it is readily seen that if the rank of a matrix 
G of order w is r = (n — />), then p of the diagonal elements in 9) be- 
come zero. It is then {X)ssible to arrange the remaining elements so that 
9) has the appearance 


frfu 

0 0 

ol 

0 

^22 9 ... 

0 

0 

0 •• 

• drr * * * 

0 

0 

0 •• 

• 0 0 • 

• • 0 

^0 

0 


0_ 


By a further transformation, the following form may be obtained: 


r 1 

0 

0 

o' 

0 

1 

0 

0 

0 

0 

. . . 1 . . . 

0 

0 

0 

• • • 0 • 

• • 0 

0 

0 


0. 


[188] 


Since the matrix 9) is s>Tnmetric, it may be desirable to find S by a 
transformation of the form S = 9^t9)9^. The matrix if will not be real, 
however, if any of the diagonal elements in 9) are negative. The form of 
6 given by Eq. 188 may nevertheless be obtained using only real matrices 
either by pre- or postmultiplication alone, or by a transformation like 
Eq. 176 in which 9 and S need not be specially related. 

This diagonal matrix (in which the first r diagonal elements are unity 
and the rest zero) is referred to as the canonical form of the square matrix 
G of rank r. The diagonal and the canonical forms of a matrix obviously 
place its rank in evidence. 

A square matrix whose rank is less than its order is said to be de- 
generate. If the rank is r = (n — p)j then p is spoken of as the degree of 
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degeneracy of the matrix or also as the nullity. The latter term is evi- 
dently suggested by the forms given by Eqs. 187 and 188. 

When the matrix G is s>TTimetrical, the matrix tP which enters the 
transformation to the diagonal according to the process indicated in 
Eq. 176 can evidently be equal to the transpose of 2,* so that this equa- 
tion may be written 

a, X G X 2 = [189] 


The matrix 2 can have a variety of forms, depending upon the values 
of the diagonal elements appearing in 3^. For certain values, called the 
characteristic values or also the latent roots of the matrix G, the trans- 
formation matrix 2 in Eq. 189 becomes orthogonal. This orthogonal 
transformation is discussed in detail in the immediately following chap- 
ters, where it is also shown that the corresix)nding values of the diagonal 
elements in 3) are the roots of the so-calle<l characteristic cqualion^ 


(rtii —X) ^12 • • • Uin 

^21 (^^22 — 






= 0 


[190] 


The left-hand side of this characteristic equation, which is formed by 
setting the determinant of the matrix G — X^l equal to zero, is evidently 
a pol>Tiomial in X of the ;ith degree. It is clear that the constant term in 
this equation (the term without X) is equal to the determinant of G. 
Moreover, the coefficient of the highest-power term is evidently ( — 1)”. 
If, then, Eq. 190 is divided through by ( — I)'', and if the n roots are 
Xi, X 2 , • • • Xn, it follows from the theory of algebraic equations that 

/I = Xi X X 2 X • • X X„ [191] 


The determinant of 3) is, of course, also given by the product of the roots 
Xi, X 2 , • • • Xn, these being the diagonal elements in 3). This result checks 
with the fact that the determinant of an orthogonal matrix is dbl (see 
Eq. 86), so that the determinant of 3) in Eq. 189 is equal to the determi- 
nant of G. One thus recognizes the invariance of the determinant of a 
matrix to an orthogonal transformation of the form of Eq. 189. 

•It is significant that is not necessarily the transpose of S when G is symmetrical. Thus 
if the 6rst operation upon the rows of G effects an interchange in any two rows, the matrix 
encountered at the next step is no longer symmetrical. Symmetry can, of course, be restored 
by subsequently performing a similar operation upon the corresponding column.^, but such 
a step is not necessary, and if it is not taken the matrices 9 and ultimately obtained will 
obviously not be each other’s transpose. 
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1 1. Additional methods for obtaining the inverse of a matrix 

Having determined the matrices and S which accomplish the 
reduction of a matrix (i to its equivalent diagonal form, according to 
Kq. 176, one may proceed to find the inverse of G by means of the follow- 
ing reasoning. The first step is to form the inverse of both sides of Eq. 
176, making use of the property expressed by Eq. 100. This gives 

a-' X G-' X = Q)-‘ [192] 

Premultiplying by S and postmultiplying by £?, one has 

G~' = a X X 9 [193] 

Inasmuch as the matrices a and 9 are already known, and the determina- 
tion of according to Eq. 67 is a relatively simple task, the formation 
of G“' by means of the result expressed by Eq. 193 involves only a nominal 
amount of additional computation. For the numerical example given in 
the preceding article (sec EcjS. 177, 178, and 179) the Eq. 193 yields 

"1 -3 -a] ri 0 Ol r 1 0 0' 

G-‘ =0 1 X 0 0 X -4 10 

_o 0 1. 0 ^ 1_ 

-4 9.5 -7" 

5 -0.5 -1 [194] 

1 -4 5j 

Alternatively one may begin with Eq. 186 and form its inverse, which 
reads 

G-* X 91"' = [195] 

Postmultiplication by 9. again gives the desired result, namely 

G-i = 3)-i X [196] 

For the same numerical problem as discussed above (see Eq. 185) this 
reads 

1 0 0 1 r“TV “1^3^ 

G-> = 0 0 X A U 

.0 0 aJ L i ^ 

-4 9.5 -7' 

5 -0.5 -1 [197] 

1 -4 5. 

If Q is reduced to the diagonal form by postmultiplication alone, that 
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is, if one has found a matrix S such that 

C? X § = a) [198] 

then a similar manipulation as given above shows that 

(2~‘ = § X [199] 

Another method of matrix inversion, which in its essential steps utilizes 
the reasoning involved in the formation of the matrix f? in Eq. 177, is 
found to be exceptionally effective in numerical work.* The procedure is 
best described by considering the equivalent problem of solving a set of 
n simultaneous equations invoking n unknowns, such as 


flll-Vl -f U12-V2 + • • • + ainXn = yi 
^21*'^1 ^22X2 4" * • * 4“ (i2nXn = y2 


4- dn2X2 4“ * * * 4" OnnXn = yn 


[ 200 ] 


Any other set of n equations which arc linear combinations of these 
(called an equivalent set of equations) clearly has the same solutions. 
Since only the values of the coeflkients a,k and those of the quantities 
yi • • y„ are significant in the formation of such linear combinations, it 
is expedient to carryout the contemplated manipulations upon the matrix 


an 

1 2 * 

* a In 

y\ 


«2l 

^22 ’ 

' a2n 

AS 

[ 201 ] 


an2 ' 

ann 




which is the matrix Q of the set of Eqs. 200 with the quantities yi • • • yn 
included as an additional column. This resultant matrix involving n 
rows and n + \ columns is referred to as the augfnented matrix correspond- 
ing to the Eqs. 200. 

In order to simplify the notation in the following discussion it is 
effective to write the symbols ai.n+i, <3r2,n+i, • • • in place of yi, 

• • • y», so that the augmented matrix assumes the form 


"^11 

Oi2 • 

• dm 


021 

022 ‘ 

* • 02n 

^2.n-fl 

JInl 

On2 • ■ 

• dnn 



The series of manipulations which are described presently, have for their 
objective the derivation of an equivalent set of equations having an 
augmented matrix in which all the elements of the principal diagonal 

miis method is described in a paper by Prescott D. Crout entitied: “ A Short Method for 
Evaluating Determinants and Solving Systems of Linear Equations with Real or Complex 
Coefficients,” A,I.E.E. Transactions ^ Vol. 60, 1941, pp. 1235-1240. 
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are unity and all those below this diagonal are zero. It is readily appreci- 
ated that once this special set of equivalent equations is determined, the 
desired solutions may be written down by inspection. 

The first step is to divide the elements of the first row in the matrix 
202 by an so as to obtain 


1 

^12 

<^13 


an 


On 

a2i 

022 

023 • 

' * ^2.n+l 


an2 

fln3 * 



[ 203 ] 


This step requires that an have a nonzero value. If this condition is not 
fulfilled it can always be met by a rearrangement of the original equations. 
The a 2 i -multiplied elements of the first row are now subtracted from the 
corresponding elements of the second row, giving 


1 

a\2 

Ul3 

^l.n+1 

0,1 

On 

0,1 

0 

^22 

623 * 

' * ^2,tH-1 

^31 

032 

^33 • 

* • ^^3.n+l 


an2 




in which 

^ ^21^12 

022 — ^22 

an 


^23 — ^23 


^21^13 

flu 


[ 205 ] 


y2,n+i — ;; 

<^11 

Next the aai-multiplied elements of the first row are subtracted from the 
corresponding elements of the third row. This leaves the augmented 
matrix in the form 


1 

0^ 

diz 


dn 

dll 

On 

0 

622 

CO 

• • • J2,n+1 

0 

^32 

^33 

• • • 63 .n+l 

.<*1.1 

dn2 

<*n 3 

• • • 


[ 206 ] 
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6e> 

with 

t ^31^12 

O22 — <IZ 2 

an 

_ £2!i!> [207] 

V3,n+1 — ^^3.71+1 


The procedure is continued until the matrix assumes the form 


1 





ail 


0 

622 ^23 * * 

‘ * ^2.71-4:1 

[208] 

0 

^32 ^33 * ' 

’ * ^ 3 .n + l 


_0 

hn2 ^nS * ■ 



A single formula for the coefficients iu 

is recognized to be given 

by 


•» 

II 

1 

agiiJik 

(ill 

[209] 

Now the elements of the second row 

are divided by 622 - (A rearrange- 

ment of the last n — 1 rows will be required if 622 happens to be zero.) 

One then has 




1 

ai 2 ai3 



<*11 <*11 

an 


n 

. ^23 

622 ■ ■ 



U 

t>22 

[ 210 ] 

0 

632 ^33 • • 

• is.n-fl 


.0 

bn2 ^n 3 * 

^n,n+lj 



The portion of this matrix exclusive of the first row and column is now 
dealt with in a manner which is identical with that just described for the 
transformation of the matrix 203 into the form of 208. In the first step 
the ftsj-multiplied elements of the second row are subtracted from the 
corresponding ones of the third row. Next the & 42 -multiplied elements of 
the second row are subtracted from the corresponding elements of the 
fourth row, and so on. This set of operations finally yields the matrix in 
the form 
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1 

^12 

ai3 


^1 ,n+l 

an 

ail 

an 

ail 

0 

1 

^23 

^24 

^2.n+l 



622 

622 

f>22 

0 

0 

^33 

^34 • 


0 

0 

^43 

^44 ' 

■ ' ^4.n-}-l 

_0 

0 

^nZ 

^n4 



[ 211 ] 


in which the coefficients c,k are given by the formula 

[ 212 ] 

022 

The portion of this matrix involving the coefficients c^k is now treated 
in the same manner as described for the original matrix 202, and as 
repeated in the manipulation of that portion of the matrix 208 involving 
the coefficients The resulting matrix then assumes the form 


1 

^12 

<^13 

ai4 




an 

an 

an 


0 

1 

^23 

^24 

^2,n4-l 


^22 

^22 

^22 


0 

0 

1 

£34 

<^3.n-fl 

[213] 




^33 

^33 


0 

0 

0 

^44 

• <^4,n-|-l 


Lo 

0 

0 


• * • 


with 



^«ifc 

C 4^3* 

[214] 





Czz 

The continuation of this process is 

obvious. In the case 

w = 4, for 


example, one arrives at the desired form 

I £12 

an an 

0 1 

®22 

0 0 1 

0 


ai4 

^15 

an 

ail 

624 

^25 

622 

622 

£34 

^35 

Czz 

^33 

1 


d44j 


[ 215 ] 


0 


0 
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The solutions to the equivalent equations having this augmented matrix 
are easily recognized. They are given by 


«44 


C35 

^34 



^*3 = “ 

Xa 




^33 



b2o 

/I04 

^23 


*2 = r~ 
022 

- — X 4 
O 22 

/>22 

^3 

d\o 

^h4 

OlA 

i1 1 0 

Xi = — 

.X 4 

— 

A 3 


(in 




[216] 


-Vo 


In order to systematize the computational procedure it is extH'dienl 
to record the following “ auxiliar>^ matrix 


^^11 


^3 

<Il4 

diF> 

dn 

dn 

dll 

du 

a 21 

h 22 

^23 

622 

^22 

*23 

b22 

d'M 

632 

^^33 

^33 

O35 

^33 

du 

642 

^43 


^45 

d44 


[2i: 


From the recursion formulas 209, 212, 214 which may be rewritten in 
the form 


, ^ 9 \*I\k 

0,k = 

Ctk = dgk 


dgk — dak 


dajdik 

dll 


622 

l> 82 f> 2 k 

622 


CazCzk 

C 22 


[ 218 ] 


it becomes dear that the elements in the aiudliary matrix can be cal- 
culated and recorded in alphabetical sequence, with the recording of the 
elements of any column preceding that of the elements of the correspond- 
ing row. 
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Although only the elements above the principal diagonal are needed 
in the final computation of the unknowns, as is shown by the Eqs. 216, 
it is necessary to calculate and record also the elements on and below the 
principal diagonal since their values are needed in the sequence of com- 
putations which determine the auxiliary matrix as a whole. This fact is 
readily appreciated from an examination of the recursion formulas 218, 
or better still through an application of them to a numerical example. 

Since only those results are recorded which are needed in subsequent 
calculations, the computational scheme is orderly and compact. It is 
also significant to mention that the calculation of any one of the elements 
of the auxiliary matrix, in the sequence described above, may be accom- 
plished through a single continuous operation on a modem computing 
machine. Thus, from the standpoint of minimizing the required opera- 
tions, the present procedure for solving simultaneous equations is excel- 
lently adapted to the available computing facilities. 

Regarding the problem of inverting the matrix of Eqs. 200 it is helpful 
to make the following preliminary observations. Suppose, in this system 
of equations, that all but one of the quantities Vi • • • Vn are zero, and that 
the nonzero one, which may be any y^, has the value unity. The par- 
ticular solution corresponding to this choice of y-values may conveniently 
be denoted by Xu*, ^* 2 ik» * * * ^nJk- Since the index k may have any value 
from 1 to there arc n such sets of particular solutions. It should now be 
recognized that the set of x- values for i = 1, when arranged in a column 
in w'hich the order of numerical subscripts reads from top to bottom, 
represents the first column in the desired inverse matrix; the set for 
k — 1 represents the second column in the inverse matrix, and so forth. 
Thus the complete array of quantities x^u are the elements of the 
inverse matrix, with the customar}” denotation of the indexes 5 and k. 

The process of determining the inverse matrix is thus seen to require 
the evaluation of n sets of simultaneous solutions. This process does not, 
however, require pi times the computatioi al labor involved in obtaining 
a single solution inasmuch as only the last column in the auxiliary matrix 
differs in the procedure for obtaining each solution. The work may be 
arranged so that all the « solutions are evaluated in one continuous 
sequence of operations by recording an auxiliary matrix with n varieties 
of “ last columns written side by side. The details of such a procedure 
are best illustrated by means of a numerical example. 

Choosing the same matrix as is used in the numerical illustrations of the 
preceding article, one begins by considering the following augmented 
matrix 

1 3 2 1 0 0' 

4 2 6 0 1 0 

3 1 7 0 0 1 


[ 219 ] 
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in which the last three columns represent three varieties of the column of 
y-values as given in the matrix 201. The first of these three columns 
represents the values >’i = 1 , 3^2 = 3^3 = 0 ; for the second of the last three 
columns, 3^2 = 3^1 = 3^3 == 0 ; and for the third, 3 ^:^ = 1 , 3^1 == 3^2 = 0 . 

Regarding all the elements of this augmented matrix as coefiicicnts a,k 
in the recursion formulas 218, one obtains the auxiliary matrix 

r \3 2100 

4 -10\3% ^ 0 [220] 

\ 

\ 

^ ft 1 1 _ 4 J> . 

^ ^ '5 \T^ 13 t3J 


Only the element values above and to the right of the elements on the 
principal diagonal (beyond the dotted line) are now used to determine 
the values of the quantities Xgk- The fourth, fifth, and sixth columns are 
regarded as last columns in computing values of for A = 1, 2, 3 
respectively. For A = 1, one finds 




^‘21 ~ 0-4 — 0.2x31 = 


13 


Xu — 1 — 2x31 — 3x21 = — — 


£ 

13 


[ 221 ] 


For k = 2, 


and for — 3, 


*32= -J3 

1 n ^ 0.5 

*22 - - Jq - 0.2X32 = - — 

9.5 

*12 = 0 — 2X32 — 3X22 = 


[ 222 ] 


*23 = 0 — 0.2X33 “ ~ 


[223] 


*13 — 0 — 2X33 — 3X23 = — 


13 



Ch. //) 


PROBLEMS 


71 


Hence the desired inverse matrix is 

a:n x,3 

X 21 *22 *23 

L*31 *32 *33j 


-4 9.5 -T 

5 -0.5 -1 
1 -4 5 


which agrees with the results given by Eqs. 194 and 197. 

PROBLEMS 

1. Write the following as a matrix equation; 


For example. 


e\i\ + C 2 rj + ^3*3 -f e4»4 


51 - 2X5-3X74-6X9+4X2 


2. Write the four scalar equations: 


= V3 


:r4 = >4 


as a single matrix equation in at least four different ways. 
3. Kxprcss as matrix equations the relations: 


dx, dxk 


for s = 1,2,3 and k = 1,2,3. 
4. Evaluate: 


[2 6 ]+[ 1 9] [-2 7] 

and 

fl 6 21 r 2 9 -61 

6 4 3 -5 -4 4 -7 10 

U ij L 2 O 8 -5j 

5. Carry out the following triple product in t-..o ways and note the difference in 
total numbers of multiplications and additions: 

[J -i 1 nil 


6. Evaluate: 


ri 1 n r 7 2 -101 

r 7 2 -loi ri 1 n 

[ 1 i li 


and 
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7 . Given: 


"1 

0 

0 

0 

cos a 

sin a 

0 

—sin a 

cosa_ 


"cos /3 

0 

-sin / 3 ' 

cos 7 

sin 7 

0“ 

6 = 0 

1 

0 

c = -sin 7 

cos 7 

0 

_sin /3 

0 

cos / 3 _ 

0 

0 

1 


Show that these are orthogonal matrices. Also form the products ab^ac, bCy and abcy 
and show that they too are orthogonal. Note that ab ba, ac cu, etc. 

8. In the evaluation of the following multiple product: 



X [4 6 3] X 



x[,^ 


2 ] 


determine that association which requires the least number of multiplications and 
additions. 

9 . Evaluate: 


2-6 4“] [a 0 0“ 

3 9 7 x 0 60 and 

1 5 Sj 10 0 


d 0 0 "] [2 -6 4 “ 

0 6 0 X 3 97 

0 0 rj Li 5 8 


10. Determine the most convenient ways of evaluating the following multiple 
products: 

[1 4]’'[-5 ;]>'[4 

t' ^41 ~M-i 5]^[: -S] 

11 . Evaluate: 

[li]’ c^]’ [\ir [{]]- 

ri 0 01 fi 0 oi 

0 2 0 X 0 0.5 0 

Lo 0 SJ Lo 0 0.2J 


12 . Using the indicated partitioning, find the inverse of the following product : 


"2 6 

1 7 

1 0 0 

1 

1 0 0^ 

v 

“-1 2 10 0" 
3 -4 ; 0 0 

0 0 1 

5 4 

A. 

0 0 1 5 -2 

.0 0 ; 

3 L 


. 0 0 1 7 -6_ 


13 . Given the nonsingular matrix: 



0 

3 

-6 



find the adjoint Q* and evaluate the products Q X G® and Q® X G. Repeat with the 
matrix: 
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Discuss the results, showing why their particular forms should be expected. 

14. Using the matrices G of the preceding problem, determine matrices and S 
which will yield diagonal forms in each of the following applicable relations: 

(i) f? X Q X S = a (ii) fP X Q - 3) (iu) Q X S = 3) 

In terms of these results, find the inverse of the nonsingular matrix of Prob. 13. 

15. Given: 


Q 


2 

-.1 

2 

0 


-I 

1 

4 

7 


find using the metlKxl of partitioning discussed in .Vrt. 7. and again by the 
method involving the “ auxiliary matrix ” as discussed in Art. 1 1 . 

16. Prove directly from the definitions of tru sum and product of two matrices that 

(i X X C' ) = (0 X ffl) X C 

and 

Cf X I ft) + = (<J X ffl) + (Q X C) 

17. Compute the third power of the matrix 


a = 


1-12 1' 
-I 3 0-3 

2 0 9 -6 

I -6 19 


and check the value 13824 of the determinant of the resultant matrix. 

18. If Ci is a nonsingular matrix, and the expres>i()n Cf " is interpreted as being 

equivalent to show that G"" is unique and. in particular, that 

I 0 • • • O' 

g, ^ 0 1 • • • 0 

I 

-0 0 

19. The row matrix [I vj ya • * • y„] may bt icgarded as equivalent to a square 
matrix having these elements in its first row and zeros in the remaining rows. With 
this interpretation of the row matrix, show that 

[1 >'J >'3 • • • y.]" = [1 >'3 >'3 • • • yj 

and more generally that 

Lvi >'s • • • = yi""‘Cyi ys • • • y.] 

Analogously, find for the column matrix that 


>ll 

r yr 

ya 1 

1 ya 

LyJ 

L>'nJ 
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20. The expression 

P{tX) = Cfo -h -f 4- • • • -f CfA'" 

in which (?o» arc matrix coeflicients and f’\' is a matrix playing the role of a 

variable, may be regarded as a matrix ixdynomial. A less general form of matric 
polynomial is 

Q{?X) = Po^l + 4- • • -f 

in which po, pu ‘ ' pn are scalar coetYicients and ^ is a unit matrix having the same 
order as fV. 

(a) Show that the equation 

QiX 4 - - 0 

in which fC, (3, and SR are matrices, has one solution only if G is nonsingular. 

(^b) In the equation 

4- 4- • • • 4- GifV f Go = 0 

let 

?X =[xi .V2 • • • .Vn] 

the coefficients G being matrices. Show that a solution is given by the expression 

n: = - (G,..vi^-» 4-G,.„i.ti"“- 4 4- Go ‘Go 

provided the matrix in the parenthesis is nonsingular. 

(c) Consider the polynomial 

=£1'2 4-<B0C 4.e 

with matrix coefficients, and 

6^ ^ r Xn Xi2“| 

IXii X 22 J 

Show that a solution to the equation 9/ = 0 f'null matrix) exists only if the following 
conditions are fulfilled 

rii^ 4- ^12^:21 4“ 4- ^12^21 = ~Cii 

(.til 4- t22 4- 6 ii)JP 12 4- f>12^22 = — C’l2 

(til 4- t22 4" ^22)^21 4* ^21-*’ll = ~C2l 

t22‘’ 4" ^21^12 4" ^2lX\2 4" f^22^22 = ■~C22 

and that the elements tn, t22, xn are independent only if 

2tii 4- 611 t2i ti2 4- ^12 0 

J X\2 til 4 - X 22 4- ^11 0 ti2 4- f>i2 

^21 + hi 0 Xn 4- X 22 4- hi x^i ^ ® 

^ ^21 4" ^21 ^12 2t22 4" ^22 

21. G and iB are two square matrices of like order whose nroduct is zero, that is 
^ X ® * 0 in which 0 is a null matrix of the same order as G or ffl. Ut Q be given 
and ffi unknown. Write the complete set of equations whose solution (if it exists) 
yields the elements bik in terms of the elements aj*. If Q has the order n and the rank 
«, what is the solution? Dbcuss the possibility and form of solutions if Q has the 
rank n — 1 , and more generally if G has the rank n ^ p. 
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Solve the specific equations 

"I 5 n ^ 

3 15 0xSS-0 and £Bx3 15 0 =0 
.4 20 1 J [4 20 1_ 

22. Given the equation 

-h 39C - 10^ = 0 

in which fC is a square matrix and % is the unit matrix of like order. Find solutions 
of the form in which .v^ is a scalar and show that one may write the matrix 
polynomial in factorecl form 

(fT - .vi%) X - X2^i) = ^^2 4- 3a: - 10^ 

Check this result for the particular value 

[1 i] 

23. If C is a matrix with complex elements -H and if C has the 

conjugate elements, show that although C -h C is real and (? — C is imaginary, it 
does not follow in general that C X t’ is real nor that C X C = C X C. 
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Linear 'J'ranstormations 


1 . \’i:CT()R SKIS 

File primary object of this chapter is to otTer a means of visualization 
for the purely alirebraic reasoninir iinderlyinir the essential ideas presented 
in Chs. I and H. I'his object is accomj>lished by giving a geometrical 
interpretation to the linear transformation and the matrix which charac- 
leri/es it. I'here are» of course, a variety of fxissible forms which such a 
geometrical interpretation may take. Ihe j>oint of view given here is 
chosen for its utility in the particular a}>plications discu.ssed in the 
reference xolume^^. 

In order to keep the discussion fairly general initially, the matrix is 
assumed to have m rows and n columns instead of being square. It is 
written 


Q - 


[ij 


If the elements of each row are c'<>nsi<iered to be the com|X)nents of a 
space vector with reference to a chosen rec tangular Cartesian co-ordinate 
system, the matrix a.ssumes an easily visualized geometrical characteriza- 
tion. It represents a set of vectors ail emanating from a common origin so 
as to form a cluster. Since the number of components of each vector is in 
general greater than three, the process of visualization lacks the physical 
clarity which ordinarily accompanies the conception of a space \’e('tor. 
A purely mathematical extension of the idea of ordinary space to the 
conception of a space of many dimensions involves, however, only a pa.ss- 
ing mental hazard. 

The co-ordinate axes in this many-dimensional space are for conven- 
ience numbered 1, 2, 3, • • • etc., instead of being lettered x, y, as in the 
usual three-dimensional case. Thus the elements (7n, • • • (/i« of the 

first row of fl, for example, arc looked upon as the comfK)nents (projec- 
tions) of a vector ai with reference to the rectangular co-ordinate Jixes 
1, 2, • * • w in an 7^'dimensional space. In like manner, the elements 

' « 2 n are looked upon as the components of a vector jo- Finally, 
the elements of the wth row define a vector The cluster of vectors 
ai, (72, • • • dm is spoken of as the veehr set of fl. 

It is possible for the matrix fl to be characterized by an alternative 
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vector set, namely the n vectors a/, a2^ • • • whose components are 
the elements tiiken by columns instead of by rows. In this case, the space 
occupied by the vectors is w-dimensional, because each vector now ha,s m 
components. I’his cluster of vectors is called the transposed vector set of 
Cf, since it is the vector set of the transix)se of fl. 

2. Linear dependence and independence ; tiie rank of a vector 

SKT 

If one or more of the vectors in a set can be expressed as a linear vector 
addition of the remaining vectors multiplied by suitable positive or 
negative numerical fac tors, the set is said to be linearly deprndcjiL If such 
an expression is not iK)ssible, the vector^ are linearly independent. For a 
set of m vectors the |K)ssible exi.stence of either condition may be ex- 
pressed mathematically by the statement that if one can not find m 
numbers 7i; 72) ’ • * “/m (excluding the choice 7^ = 73 = ■ • • = 7„i = 0) 
for which the following \ ector ecjuation holds 

ynh + 72<l2 + • • • + 7mt^m = 0 [2] * 

the vectors form an inde|)endent (otherwise a dependent) set. 

In three-dimensional s{)ace, any three vectors wdiich do not lie in a 
plane form an indejumdent set. If the three vectors do lie in a plane, one 
or more of them evidently can be expressed as a vector sum of the others 
multiplied by suitable factors. If in this same space the number of vectors 
is greater than three, the set must necessarily be dependent. In order for 
the vectors to form an independent set, it is necessary that their number 
shall not exceed three, although this condition alone is clearly not 
sufficient. 

In an ;/-<limensional space, the largest nirnber of independent vectors 
is n, but a dependent set may contain any iumber of vectors. The num- 
ber of available dimensions in such a space is n. The vectors occupying 
this space may have such relative orientations that fewer than n dimen- 
sions are actually consumed or utilized. In three-dimensional space, for 
example, a set of vectors which lie in a plane consumes only two dimen- 
sions, and if they lie in a line they utilize only one of the available three 
dimensions. 

It is clear that if the number of dimensions utilized is less than the 
number of vectors in the set, the vectors are linearly dependent; but if the 
number of utilized dimensions equals the number of vectors (it obxdously 
cannot be greater than the number of vectors in the set), the vectors are 
linearly independent. If the number of vectors is equal to or less than the 
number of available dimensions, both cases can occur; but if the number 
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of vectors is larger than the number of available dimensions, the only 
possibility is for the vector set to be a dcjxMident one. 

The number of dimensions actually consumed l)y a vector set is by 
definition equal to the nink of that set. The vector set of Cf, Lq. 1, con- 
sists of vectors in an //-dimensional space. 'I'he rank of this set can at 
most be equal to ;/. If /// > //, the vector set of Cf must be dejxndent, for 
the rank is then necessarily less than the number of vectors. In order to 
determine the rank, one may begin by considering consecutively all 
groups of // vectors which can be selected from the given set (the number 
of such groups equals the number of combinations of m things taken n at a 
time) and examining them to determine whether a linear relation of the 
form given by Eq. 2 can be fouiul to exist. If one or more groups can be 
found for which such a relation does not exist, the rank is //; if it exists for 
all groups, the rank is less than /;, and one must proceed to investigate 
in the same fashion all groups of (/t — 1 ) vectors which can be selected 
from the given set. The largest number of independent vectors which can 
eventually be selected in this manner is equal to the rank of the vector 
set. 

In the light of the discussion in Art. 10, Ch. I, regarding the rank of a 
determinant, the procedure just described may be seen to be equivalent 
to the following statements. From the matrix (f (with m > ;/), all the 
«-rowed determinants are selected. There are as many of these as there 
are m things taken n at a time, and they correspond to the groups of n 
vectors previously mentioned. The highest rank to be found among these 
determinants is the rank of a vector set of Cf. 

In order to appreciate the truth of the latter assertion, it is necessary 
first to observ'e that the independence of (;/ — 1) vectors in an «-dimcn- 
sional space may be established by consideration of all the “ projections 
of these vectors onto the n co-ordinate ‘‘ planes, comprising {n — 1) 
dimensions each. Such a “ projection ” is carried out by simply allowing 
the components of all the vectors along one particular axis of the space 
to go to zero, or be disregarded. Consideration of the problem of two 
vectors in three dimensions will lead to the recognition that a necessary 
and suflicient condition for independence of the (n - 1) vectors is that at 
least one of these n projected sets should be independent. Similarly, to 
insure dependence of the set, all the n projected sets must be deixndent. 
In an identical manner the independence of {n - p) vectors in an n- 
dimensional space is determined by the independence of at least one of the 
group of all possible {n - //)-dimensional “ projections ” of the set, 
formed by striking out p components at a time from all the vectors. If, 
and only if, all such projected sets are dependent, the set itself is de- 
pendent. It is then pointed out that all possible groups of (n - p) vectors 
which can be formed from the original group of m may be regarded as 
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selected by choosing all jx)ssible groups of {n — p) vectors from each of 
the groups of n vectors originally selected. The process of ascertaining the 
highest rank among the n determinants first chosen is then seen to be 
exactly equivalent to finding the largest number of independent vectors 
in the set by a method of successive projections. 

In the tran.s|X)sed vector set of (2 the number of vectors is n and the 
number of available dimensions is m. Still assuming m > n, the rank can 
at most be «, and is equal to the largest number of independent vectors 
which can be forme<l from the given set. The dependence or independence 
of a selected group of vectors is ascertained through noting whether or 
not a relation of the form given by Eq. 2 exists. This process of investiga- 
tion is, however, equivalent to determining the highest rank to be found 
among all determinants formed from the transpose of Q by selecting n 
columns in all possible combinations. These determinants are simply the 
transposed ones of those referred to in the preceding paragraph. Since the 
rank of a determinant is the same as that of its transpose, it therefore 
follows that the vector set of the transpose of Q has the same rank as the 
vector set of Ct. 

The same conclusion holds when m < n and m = n. When m ^ n, the 
rank of the vector set or its transpose is at most equal to the smaller of 
the numbers m and When m = which is the case of greatest practical 
imjwrtance, the definition of the rank of the vector sets agrees with that 
already given for the rank of a square matrix and its determinant.* 

3. Vector significanxe of a linear til\nsformation 

The ideas developed in the preceding articles suggest a useful geometri- 
cal interpretation for the linear transformation 

<^11^*1 + • * * d" ~ Jl 


+ * • * + OrnnXn ^ >’n 

with the square matrix 

dll * • * a\r, 

a = 

[_dr„l ' • • <7nnJ 

In the «-dimensional space occupied by the vector set of Q, two addi- 
tional vectors x and y, with components Xi, X 2 , • • • x„ and yi, y 2 , ■ ■ • yn 
respectively, are defined. The left-hand sides of the equations in the set 3 

•In connection with the discussion of this article it may be of interest also to refer to Art. 6, 
Ch. IV, for an alternative method for esUblishing the linear dependence or independence of 
a given vector set. 


[3] 

[4] 



so 


USE. 1 R TR. I SSEORM. I TIOSS 


(C7/. Ill 


arc then recomh/cd io he the scalar products of the vectors t/i, a-j, ' (in 
respectively and the vector .v. 

The scalar product icf. Art. 2, Ch. V) is e.xpressihle either as the sum of 
the products of corrcs|Muuiing comixmenls of two vectors or as the product 
of their lengths and the cosine of the angle between them. If the given 
vectors are at right angles to each other, their scalar product is evidently 
zero. 

With the -calar product denoted by a dot i)laced between the symbols 
for the veaors tCiibbs's notation), Kqs. 3 may be written in the alterna- 
tive compact form 



These equations are .said to transform a vector .r into a vector y, the 
components of the latter being the scalar products of the vector .v with 
those in the vector set of Cf. 

Any vector .r with given length and direction is transformed into 
another vector y by means of Kqs. 5, the mechanism of the transforma- 
tion being determined by the clu.ster of vectors »7i, (Zo, • • • As the 
length and direction of the vector x are varied at will, the length and 
direction of the vector y vary in a corresp<^nding manner. I'he vectors 
X and y may be visualized as rods emanating from a box which contains 
the mechanism of the transformation as characterized by the matrix Cf 
or its v’ector set. As the rod representing .v is pulled out or pushed in and 
its direction changed, the mcchani.sm in the box causes the rod represent- 
ing y to lengthen or shorten and change its direction in a corresjxznding 
manner. 

For the moment, the mechanism in the box may be assumed to be so 
constituted that each length and direction of the rod representing x 
uniquely determines a length and direction for the rod representing y. 
This is evidently the case if the matrix Cf is nonsingular. Then the trans- 
formation is reversible; that is, the rod representing y may be moved 
about at will, thus causing the mechanism in the lx)x to produce corre- 
spx)nding lengths and directions for the rod x, A given jwint in .space for 
the tip of the rod .r determines one (and only one) point for the tip of the 
rod y, the corresponding points being independent of whether the rod x 
is pushed and the rod y caused to follow or vice versa.* 

*In particular it may be of interest to note that if the vector x is move<I about in a plane, 
the vector y remains in a plane also; and if the tip of the ;c-vector follow.s a straight line, the 
tip of the y-vector likewise fallows a straight line. These matters arc treatwl in greater dcUiil 
in the discussion immediately following I>i. 168 in Art. 10. 
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Viewed in this mechanical manner, the transformation 5 and its 
inverse involve the same mechanism. Algebraically, however, the inverse 
transformation 


b\\y\ + • • • + 6i„y„ = xi 


f>niyi + • • • + b„„y„ = 


[ 6 ] 


written in vector form 


ft, • y = .Y, 
bo ■ y = .V 2 


bti ' y 


[7] 


involves the vectors bi, b 2 , ■ ■ ■ b„, which are the vector set of the inverse 
matrix 


Cf-' = fil = 



[ 8 ] 


Since X Q equals the unit matrix, and is formed through multiplying 
the rows of fB by the columns of (?, it follows that the scalar product 

for i = k 
for i 7 ^ k 


bi • Ok 




[9] 


The vectors a/, a 2 ^ * * • arc the transposed vector set of Q, More 
sjxjcifically, the relations 9 state that bij for example, stands at right 
angles to the vectors whereas its scalar product with 

(/i ' is unity; bo stands at right angles to the vectors ai\ and its 

scalar product with is unity, and so forth. Tliis situation offers an 
interesting geometrical view of the relation between a nonsingular matrix 
and its inverse. 

If the square matrix Q is s>'mmetrical, 


= <^1 
(72^ = a2 

On' = In j 


[ 10 ] 


Then the orthogonal relationships just described hold directly for the 
vector set of Q and the vector set of its inverse. 


4. Orthogonal CO-ORDINATE transformations; orthogonal vec- 
tor SETS 

It sometimes becomes necessary in dealing with physical problems to 
transform from one co-ordinate system to another. With respect to a 
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given reference system whose rectangular itxes arc numbered 1, 2, • • • w, 
the co-ordinates of a particular point may be denoted correspondingly 
by the quantities .Vi, * ■ • .Vn. In a second rectangular reference system 
whose origin coincides with that of the given one, the axes which are 
labeled l', 2', • • • n' have difTerent angular orientations with respect to 
the axes of the original system. This second reference system may be 
thought of as obtained by simply rotating the iLxes of the given system, 
keeping the origin fixed and maintaining the mutual orthogonality of the 
axes. 

The point in space whose projections on the original set of axes arc the 
quantities .Vi. .Vo, • • • .v„ has projections on the ilxcs of the second reference 
system which are conveniently denoted by .v'l, .v'o, • • • .v'„. These are the 
co-ordinates of the same point with respect to the second reference 
system. The process of expressing the quantities .r'l, x '21 * * • in terms 
of .Ti, .V 2 , ’ • Xn is spoken of as a co-ordinate transformation. 

In order for such a co-ordinate transformation to be carried out, the 
relative orientations of the axes in the original and the new reference 
systems must, of course, be known. Algebraically the directions of the 
new axes with respect to the old ones are expres.sed by quantities called 
direction cosines. If the cosines of the angles between axis l' and the 
axes 1, 2, • • • « are denoted respectively by the coellicients (?i 2 » * * * 
the cosines of the angles between axis 2' and the axes 1, 2, • ♦ n respec- 
tively by (721) ^ 22 ) • * • Ozm and so forth, the array of coefficients in the 
matrix 

0\\ 0\2 ' • • Oin ”1 

Ooi O22 ' • 02n r- -T 


Onl On2 ’ ‘ ' ^nn J 

are the complete set of direction cosines of the new axes with respect to 
the old ones. 

The desired co-ordinate transformation is then expressed by the linear 
set of equations 

OiiXi + O12X2 + • • • + OinXn = x' i 
021^1 + ^^ 22^2 + * ' • + 02nXn = 


^nl^l + On2X2 + ’ * ' + = x\ 

In order to appreciate the correctness of this result with regard to its 
general form, and also to clarify the geometrical relations involved, the 
reader may carry through the derivation of this transformation for the 
simple two-dimensional case. 

In the language of the vector interpretation given in the preceding 
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article, Eqs. 12 transform a vector x with the components Xi, X 2 y ' • x^ 
into a vector jc' with the components x'l, x' 2 , * * • Since the origin 
for both co-ordinate systems is the same and only one point in space is 
involved, x and at' are the same vector. It has a different direction with 
respect to the new reference axes from that which it has with respect to 
the old axes, but its length must certainly be the same in both reference 
systems. This fact means that 

^ 1 “ 4“ ATo" + * • • = X^ ^ H- X^ ^ + * • * + X^ r? [13] 

The right-hand side of this equation may be obtained through mul- 
tiplying Eqs. 12 respectively by .t'l, Ar' 2 , • • • and adding. This process 
leads to a double summation. In order to abbreviate the writing of the 
result, it is effective to use the summation sign. Thus Eq. 13 with 12 sub- 
stituted becomes 


Z = Z L OsrXrX\ = Z 

3=lr=l r=l 


[14] 


Reversing the order of summation in the double sum yields 

L ( L a:, = i; avTr [15] 

r = 1 = 1 / r =1 

from which it becomes clear that the stipulation expressed by Eq. 13 
leads to the result 


Z 8 — 

Written out for r = 1, 2, • • • w, this reads 

+ Ooix'o + * • * + 0nix\ = Xi 

022x\ 4" * * * "h 0n2X n ” -^2 


OlnXl + OonXo + 


[16] 


[17] 


“b Onri' n Xn 


This is the inverse transformation with respect to the one expressed by 
Eqs. 12. Its matrix is seen to be the transpose of 0. A matrix whose 
transpose is equal to its inverse is shown in Art. 6, Ch. II, to have the 
properties expressed by Eq. 78 or more sjAecifically by Eqs. 79 and 80 of 
that chapter. 

The direction cosines of the new set of rectangular co-ordinate axes j' 
with respect to the old ones, therefore, satisfy the relations 

_ / 1 for 5 = r 

0»l0rl + 0820r2 + * * * 4“ O^nOrn *" 1 Q for 5 f 

and 

_ / 1 for 5 = r 

+ 02802r 4” ' * + OnsOnr “ ] Q for S ^ f 


[18] 

[ 19 ] 
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In terms of the vector set of 0, that is, ^> 1 . - On and the transix)sed 

vector set • * • ojy these relations may be expressed by the scalar 

products 


Os • Or 



1 

0 


for 5 = r 
for 5 r 


[201 


and 


• Or = 


1 for s = r 
0 for s ^ r 


[ 21 ] 


It thus becomes clear that the vector set of the matrix belonging to the 
transfonnation from one rectangular co-ordinate system to another con- 
sists of a cluster of mutually ortliot^onal vectors of unit length. The same is 
true of the transix)sed vector set, which also is the vector set of the inverse 
matrix. The reason for designating this kind of matrix as an ortlioj^oual 
one is thus evident. The vectors likewise are said to form an orthogonal 
set. 

The relation 9, which is shown in the ])revious article to hold for the 
elements of any nonsingular matrix and those of its inverse, becomes 
identical with Eq. 21 when written for the elements of the orthogonal 
matrix and those of its inverse or trans|K)se. 

As pointed out in the discussion of the orthogonal matrix, the product 

0 X 0/ = 0 X 0“’ = ^^1 [22] 

Since the determinant of the unit matrix is unity, and the determinant 
of 0/ is the same as that of 0, it follows that 

0-^ = 1, or O = ±\ [23] 

A closer examination of the geometr>' involved shows that the value of 
the determinant 0 is -fl if the directional sequence of the vectors 
Oiy 02 y ' ' On corresponds to a right-hand screw rule, anrl it is —1 if this 
sequence follows a left-hand screw rule. If both the reference axes I, 
2, • • • w, and l', 2', • • • n form right-hand or both left-haiul systems, 
+ 1 ; but if one is a right- and the other a left-hand system, O = — 1 . 

; Besides being the matrix of a transformation representing the rotation 
of a rectangular system of reference axes, the orthogonal matrix is 
^evidently also the matrix of a transformation for which a vector x and its 
[transform y have the same length. In other words, a linear transformation 
may represent either a change from one co-ordinate system to another or 
the transformation from one point in space to another with respect to the 
same reference system. If the transformation is an orthogonal one, it 
represents a pure rotation about the origin; that is, a rotation either of the 
co-ordinate system for a fixed point, or of an arbitrary point in a fixed 
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system. In either case, the axes of the reference system are at right angles 
to each other if the transformation is an orthogonal one. 

Such a rectangular co-ordinate system is commonly called a Cartesian 
system, though Descartes (after whom it is named) also devoted many 
of his studies to systems of co-ordinates in which the axes make oblique 
angles with each other. In a three-dimensional oblique reference system, 
the projection (co-ordinate) of a point on a given axis is found by dropping 
a line to this axis parallel to the plane of the other two axes (in a rectangu- 
lar reference system this line is the perpendicular dropped from the point 
in question). An analogous interpretation applies to oblique systems of 
more dimensions. 

Any linear nonsingular transformation may be regarded as a trans- 
formation of the co-ordinates of a point in a given oblique co-ordinate 
system to those of the same point in another system with coincident 
origin, or it may be regarded as the transformation from one point to 
another in the same oblique co-ordinate system. 

An oblique co-ordinate system is also called an affine system. The 
transformation from one such system to another or from one point to 
another in the same system is sometimes referred to as a linear affine 
transformation. The orthogonal transformation is a special case of this 
more general tyjx*. 

5. Tr..\xsfor\l\tion to ax oblique co-ordix.\te system 

In the transformation from one orthogonal co-ordinate system to 
another, the direction cosines of the axes 1' with respect to the axes 
1 , 2, • • • w (these are tlie elements of the first row of the matrix 0, Eq. 11) 
may be regarded as the components (projections) of a unit vector lying 
in the axis l'. This is the vector Oi of the vector set of 0. Similarly the 
vector 02 of this set is seen to be a unit vector emanating from the com- 
mon origin of the two orthogonal co-ordinate systems, coincident in 
direction with axis 2'. The vector set o^, 02 , • • • of 6 is thus seen to be a 
set of coterminus unit vectors coincident respectively with the orthogonal 
axes of the system 1', 2', • • • n\ By a similar line of reasoning, the trans- 
posed vector set Oi^t 0/9 ' — On is recognized as a set of coterminus imit 
vectors coincident respectively with the orthogonal axes of the co- 
ordinate system 1, 2, • • • w. 

This helpful visualization or geometrical interpretation of the co- 
efficients of a transformation matrix may be extended to the nonor- 
thogonal matrix associated with the transformation to an oblique 
(affine) co-ordinate system, but several novel features enter into the 
geometrical interpretations, requiring further detailed discussion. 

From the algebraic point of \iew, the number of dimensions of the 
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space under consideration is arbitrary, but for facilitating the geometrical 
visualization of the argument, reference is made to the two-dimensional 
Fig. 1 . Here there are drawTi three co-ordinate systems — one rectangular 
Cartesian system and two oblique systems. The axes of the rectangular 
system are designated by the letters .Yi and A'2. The first oblique system 
has the axes Si and So, and the second oblique system has the axes S*i 



Fig. 1. The vector x described in equivalent fashion in oblique and rectangular 

co-ordinate systems. 

and S*2 which are at right angles to the axes S2 and Si respectively of 
the first oblique system. The reason for considering the two oblique 
systems S and S’*' (with the relative orientations of axes as just described) 
in conjunction with the rectangular system X follows from the discussion 
below. 

For the rectangular co-ordinate system X, a set of unit vectors Wi, 
«2, • * • if'n is defined which emanate from the common origin O and are 
coincident respectively with the axes 1 , 2 , • • • « of this system, and hence 
are mutually at right angles with each other. A vector x from 0 to P has 
projections on the axes 1, 2, • • • w which are denoted by Xi, 0:2, • • • Xn 
respectively. The vector components of this vector are WiXi, U2X2, • • • 

The vector sum of these components, according to the usual parallelogram 
(or parallelopiped) rule of vector addition, yields the vector x, thus: 

X = UiXi -f U 2 X 2 + • • • + UnXn [24] 
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In order to provide an equivalent representation for the vector x in 
the oblique system H, a set of unit vectors < 22 ) • • is defined, which 
emanate from the common origin O and are coincident respectively with 
the axes 1, 2, • • • n of this oblique system. The projections of the vector x 
(co-ordinates of the point P) upon these oblique axes are denoted by 
fi) f 2 ) • • • fn respectively. The vector components of x in the oblique 
system S are, therefore, aiJi, • • • </n?)i, and their vector sum also 
yields the vector x, that is, 

^ + <22^2 + * • • + <2„{n [25] 

Finally, the vector x may also be represented in the second oblique 
system by defining for it the set of unit vectors a*i, a* 2 y • * • <i*n 
emanating from the common origin O an<l coinciding respectively with 
the axes 1, 2, • • • w of this system. The projections of x upon these axes 
are denoted respectively by ^* 2 * * * * The vector components 
of x in the system Z* are < 2 * 2 i* 2 ^ * * * <2*n$*n, and their vector sum 

yields 

X = -f <2%{*2 + • • • + a^n^n [26] 

The unit vectors iq, « 2 , • • • are regarded as the vector set of a matrix 



Wll 

//,2 • • 


CU = 

2/21 

U 22 • • 

• U 2 


Jinx 

dn2 • • 

• li,u 


in which the elements are the components of the vectors with respect to 
the rectangular co-ordinate system A". According to the definition of this 
vector set, it follows that this matrix is the unit matrix 


qi = 

The unit vectors <2i, < 22 * * • < 
as the vector set of a matrix 

a = 


’1 0 0 • 
0 10- 

• • 0 "' 

• ■ 0 


[28] 

0 0 • • 




In of the co-ordinate system 

S are regarded 

an <^12 * * 
a2i <222 * * 

• a\n 

• <J2b 


[29] 

<2nl <2„2 • * 

* <2nn _ 




in which the elements are the components of the vectors with respect to 
the rectangular system X also. For example, an, ai 2 . • • • dm are the 
projections of the vector ax upon the axes 1, 2, • • • « respectively of the 
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system A'; ^21^ <^22^ • * * are the projections of the vector u{X)n the 
axes 1 , 2, • • • ;/ respectively of the system X, and so forth. 

Similarly the unit vectors u*i, 0*2^ ■ * * co-ordinate system 

S* are regarded as the vector set of a matrix 

“a*ii 

fl* = [30] 

a*u2 ■ ■ ■ 

in which the elements again are the comiK)nents of the vectors with 
respect to the rectangular system A'. That is. (7*n, a*i2, • * * are the 
projections of the vector a*i ujwn the axes 1, 2, • • * ;/ respectively of the 
system A'; 0*21, (1*221 * * * ^*2n are the projections of the vector a*2 upon 
the axes 1, 2, • • • n respectively of the system A', and so forth. 

According to the definition of the systems Z and Z*. the axis 1 of Z is 
orthogonal to the axes 2, 3, • • • w of Z*; the axis 2 of Z is orthogonal to the 
axes 1, 3, • • ■ » of Z*, and so forth. Hence the scalar products of Ui with 
the vectors a*2y CL*^y • • • CL*n are zero; the scalar products of the vector tf2 
with the vector a*i, a*3, • • • <2*„ are zero, etc. 

At this point a peculiarity of affine geometiy' (usually confusing to 
engineers) must be clearly understood. A distinction must be made be- 
tween the scale by means of which any length is measured and the scales 
which are attached to the oblique co-ordinate iLxes. The “ scale of 
length ” is that which is attached to the rectangular Cartesian iixes X 
(this is the same for all the axes 1, 2, • • • zz in this system). The oblique 
axes carry their own scales but they are not used to measure length. 
The system Z may in general have a different scale for each of its axes, 
and they are different from the scales which are used to lay off the units 
on the axes of the system Z*. The unit vectors Uj, Uot * * are units 
according to the scales for the axes of the Z system, and the unit vectors 
(i*1j d*2i * • • (i'*n are units according to the scales for the axes of the Z* 
system, but none of these unit vectors in general has unit length according to 
the scale of length/' which is the scale for the units laid off on the axes 
of the system X. 

The reader should appreciate that a scale on a co-ordinate axis may, 
in general, have a twofold purpose: 

(a) It may be used to detenrune the value of the projection of a point 
upon that axis. 

(b) It may be used as a tape measure is used to determine the distance 
between two points in space. 

A scale may be used for either purpose alone, or for both. Use (a) is 
the one with which the reader is undoubtedly most familiar in connection 
with his analytic work. For example, when plotting a function y = f{x)y 
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scales are employed for the x- and y-axes which seldom have the same 
size of unit, and are used merely to read values of x and y, not also to 
measure length or distance. 

Interest in the measurement of length arises only when geometrical 
considerations enter into the problem. When they do, one must have a 
“ scale of length ” (tape measure) in addition to the various scales whose 
use is restricted to the determination of the values of projections. In the 
present problem the identical scales carried by the axes of the system X 
ser\'e also for the measurement of length. I'his designation haxing been 
made, it is obvious that no other scales (except if they are identical with 
those of the A'-.system) may be used for the measurement of length. 

For curvilinear co-ordinates the situation is even more confusing, since 
the scales which the co-ordinate axes bear not only are in general different 
for the different axes of the same system, but also vary from point to point 
along these iuxes, whereas the scale of length is independent of the co- 
ordinate system and independent of the location within any system. As 
long as the oblique axes arc linear, their scales are the same for all points 
within the system. 

The units for the scales carried by the axes of the oblique systems may 
hav'e their lengths (as measured by the scale of the A'-system) so adjusted 
that the scalar products of the unit vectors Ui, (/.>, • • • < 7 ,, respectively with 
the unit vectors < 7 * 1 , are all unity. (I'his adjustment is called 

normalization.”) For example, if the angle between ( 7 i and ( 7 *i is ai, 
the lengths of < 7 i and ( 7 *i (that is, ai and ( 7 *i ) are determined so that 
|t 7 i| X 'u*i* cos ai equals unity. Since ai is an arbitrary angle, it is clear 
that although the length of either ( 7 i or ( 7 *i can be chosen equal to unity, 
both unit vectors certainly cannot have unit length. 

As a result of this normalization process and the orthogonality between 
unlrke numbered axes in the systems H and H*, the sets of unit vectors 
in these systems satisfy the following conditions- 


• a*k = 


1 for i ~ k 
0 for i 9 *^ k 


[31] 


in which the scalar product is indicated by a dot. In terms of the matrices 
29 and 30 these conditions are expressed by the matrix equation 


1 

ai2 • 

022 * 

* Ojn 

* 02n 

\ 

X 

a* \\ 
0*12 

0* 21 
0*22 

1 

0 nl 

0 n2 


“1 

0 

0 

1 

• 0 0 

• 0 0 

J 

L^nl 

<^n 2 * 

* * 

1 

.0*ln 

<i*2n 



0 

0 


.J 


[ 32 ] 



90 


UN EAR TRANSFORMATIONS 


[Ck. Ill 


which is equivalent to 

a xGt*i = ^ [33] 

Hence it follows that 

or fl* = Qr^ [34] 

The matrix Q* is, therefore, the reciprocal of C?. The vector set a*i, 
(7*2) * * * is called the reciprocal of the set Ui, (72, • • • and the oblique 
co-ordinate system H* is referred to as the reciprocal of the oblique 
system E. The present result yields a useful geometrical interpretation 
for the elements of a square matrix and those of its reciprocal. Inciden- 
tally, it may be recalled in this connection that an orthogonal matrix is 
its owTi reciprocal, and this conclusion checks with the fact that an 
orthogonal co-ordinate system is also its own reciprocal. 

By use of one of the three equivalent relations for the vector x given by 
Eqs. 24, 25, and 26, the length of this vector may be expressed in terms 
of the co-ordinates of the rectangular system -V, or of either of the oblique 
systems S or **'. The square of this length is evidently given by the 
scalar product of the vector .v with itself. Using Eq. 24, one has 

\x\^ = X • X = (UiXi + 142X2 + • • • -f UnXn) 

• (WiXi + U2X2 + • • • + [35] 

Because of the mutual orthogonality of the unit vectors, the scalar 
product of any vector with any other in the set is zero, whereas this 
product of any vector with itself is unity. Hence in terms of the co- 
ordinates of the rectangular system A", 

+ V + • • • + [36] 

which is the familiar Pythagorean proposition extended to n-dimensional 
space. 

This result may be expressed in terms of the co-ordinates fi, $ 2 ) * * * fn 
of the vector x in the oblique system H by first determining the expres- 
sions for xi, X 2 , • • • Xn in terms of the |’s, and then substituting these into 
Eq. 36. A general expression for the components Xj, X 2 , • • • Xn is evidently 
given by 

Xk = Uk -x for ^ = 1, 2, • • - n [37] 

which states that the projections of x on the axes 1, 2, • • • « of the system 
X are the scalar products of x with the unit vectors «i, M 2 , • • • m„ of this 
system. 

Substituting Eq. 25 into Eq. 37 yields 

Xk = ttk' fllfl + • 02^2 + •••+«*• ®nfn [38] 
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Uk • ai = aik [39] 

because the scalar product of Uk with ai equals the projection of the 
vector a, upon axis k of the rectangular system X. Hence Eq. 38, written 
out for ^ = 1, 2, • • • w, reacis 


= Ollil + 021^2 + • * • 4 - (Inlin 
Xq = 4 - 022^2 4 - • • • + CLn2^n 


Xn = «ln{l 4- fl2nf2 4" * * * 4" 


[40] 


This set of equations may be written in matrix form by defining the 
column matrices 


and 


x] 


Xi 


Xn 


(l 

(■2 




[41] 


[42] 


With the transpose of the matrix 29, Eqs. 40 evidently are expressed by 


•r] =a,i] 


[43] 


Substitution of this result into Eq. 36 is now greatly facilitated by 
noting that 

f;;l 


1^:1* = *]« X *] = [*iar2 • ■ • x„] X 


= xi^ + X2^+ ■■■ +x„^ [44] 


Thus the square of the length of the vector x, expressed in terms of the 
co-ordinates of the oblique system S, becomes 

\x\^ = €]« X QQ, X f] 


[45] 
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Here it is effective to let 



’^11 

gl2 • 

* gin 

9 = cf X a, = 

^21 

g22 * 

* g2n 



gn2 • 

‘ ' gnn _ 


The elements of this matrix are evidently expressed by the scalar products 

gik = • <U- [*^7] 

from which it is clear that the matrix S is sx'mmetrical. 

Equation 45 may now be written out, as follows: 

ki’ = c?llsl“ + + * * • + AMnflJn 

+ i?2lf2sl + J?22^2” + • • * + f^2n(2in 


+ gnlinsl + g«2?«s2 + * * * + gunin' [48] 

In the oblique system H, the length of the vector .v is fiot simply equal to 
the sum of the squares of its comj)onents in that system, but the result 
expressed by Eq. 48 shows that all cross-product terms are present as 
well. Because of the symmetry of the 8 matrix, which is expressed by 

g<k = Ski [49] 

it is clear that all the cross-product terms in Eq. 48 occur twice. This 
fact makes possible some condensation of the form of Eq. 48. 

If all the scales for the axes of the Z system are chosen equal to the 
scale of length (as can be and usually is done), the unit vectors have 
unit length. It then follows that all the elements on the principal diagonal 
of the matrix 8 become equal to unity. If the angle between the axes 
i and k in the Z system is denoted by 0,ky the result in Eq. 47 shows that, 
for this choice of scales, the elements of 8 are given by the simple formula 

gik = cos dik [50] 

which yields unity for i = k. 

The determination of the matrix 8 is intimately connected with the 
choice of a scale in terms of which a length may be measured in the 
oblique co-ordinate system. Length in this system is defined by the matrix 
9. The latter is the matrix of the so-called fundamental metric tensor 
whose components are the elements gik- 

The present discussion should be recognized as fundamental to tensor 
algebra also, even though we are not concerned here with that closely 
related subject. Matrix equations and tensor equations, in fact, differ 
only in notation. The geometrical interpretations of them, as far as this 
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discussion is concerned, are identical. Tensor notation is indeed some- 
what more advantageous than matrix notation for the operations now 
under consideration. The view taken in this book, however, is that 
matrix algebra is an invaluable aid to clear comprehension of the tensor 
method, and that the student will more readily assimilate that method 
once he has understood the essential ideas underlying the present dis- 
cussion. Proper grasp of the tensor method comes not from the mere 
acquisition of a set of manipulative rules but rather from recognition that 
tensor algebra is a symbolic representation of geometrical and physical 
ideas. 

The square of the length of the vector x given by Eq. 36 may also be 
expressed in terms of the co-ordinates of the reciprocal oblique system 
Thus, using Eq. 26, the co-ordinates of x in the rectangular system 
arc, according to Eq. 37, given by 

.Vjt = Uk • -f life • + •••+«&• [51] 

Here it is recognized that the scalar product 

lif: • a*i = a*ik [52] 

so that Eq. 51 written out for k = 1, 2, • • • w reads 

Xi = u’''ns*l + + * • ' + Cl* nl^*n 

X<2 = (I* 1 2s* I + Cl* 22s* 2 + • * • + ^^*n2^*n 


Xn = -f a*2ne2 + * ‘ [53] 

With the definition of the column matrix 



and the transpose of the reciprocal matrix 30, Eqs. 53 are given in matrix 
form by 

.v] = a*te] [55] 

Utilizing again the relation expressed by Eq. 44, the square of the length 
of the vector x in terms of the co-ordinates of the reciprocal oblique sys- 
tem reads 


2 = j*], X a*a*t X {*] 


[56] 
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Here it is effective to introduce the matrix 



6*12 • 

■ • 6*ln 


6*22 • 

' S*2n 


6*»2 • 

. . P* 

6 nn 


The elements of this matrix are given by the scalar products 

g*iu = «*. • = g\i [58] 

In view of Eqs. 34 and 46, it is recognised that 

9* = ffr' X (f~* = (C? X Qt)-^ = [59] 

The matrix 9*, which is fundamental to the measurement of length in 
the reciprocal oblique system, is the inverse of the matrix 9. Hence 

^ [60] 
C/ 

in w^hich G is the determinant of 9, and Ga- its cofactors. Alternatively, 


in which G* is the determinant of 9* aiifl its co far tors. Evidently, 
by Eqs. 46 and 57, 

G = /12 = [62] 

A representation for the elements g*ik similar in form to that given by 
Eq. 50 for g.jt is, according to Eq. 58, 

fik = X \a*k[ cosd*^k [63] 

in which [a*,] denotes the length of a unit vector in the reciprocal co- 
ordinate system H*, and d*ik is the angle between the axes i and k in this 
system. 

In general, the lengths* of the unit vectors in the oblique systems are 
given by the expressions 

|ai| = V flit* + a, 2 * + • • -f a,„2 = [64] 


and 

\a*i\ = + a*i2^ + • • ■ + = \/]r. [ 55 ] 

*The term “ magnitude ” would actually be more appropriate here than “ length ” becau>'e 
the unit vectors ai and a*i, like those for the rectangular coordinates, are assumed to be 
without dimensions. 
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When the former are chosen equal to unity, the latter are in general all 
different from unity; that is, each axis of the S* system carries a different 
scale. 

The result given by Eq. 56 for the square of the length of the vector x 
in terms of the co-ordinates of the reciprocal oblique system, when 
written out, reads 

1x1* = 2 + • • • + 

+ ^*2lf*2{*l + g*22?*2* + • • • + g*2n{* 2?* n r^^-i 


+ g*n , + ••• + fnnfn^ 

Again all cross-product terms (double products because gik = g*ki) are 
present in addition to the square terms. 

Still another form for is possible if in Eq. 44 both of the alternative 
expressions 43 and 55 for x] are substituted, one for x]t and the other 
for x]. Then the result reads 



|a-1* = X m*, X n 

[67] 

or 

I*!* = e] X x ?] 

[68] 

But by Eq. 33 

axQ*, = Q*xQ, = 

[69] 

so that 

N^=f]iXf*] = «*],Xf] 



= f.C. + f2C2 + • • • + f.fn 

[70] 


In this expression, no cross-product terms are present. It closely 
resembles the simple Pythagorean form given bv Eq. 36, the only dif- 
ference being that products of corresponding con.ponents of the vector x 
in the two oblique systems appear in place of tiie squares of one kind of 
component. 

The {jfc are called the contravariant components and the the co- 
variant* components of the vector x. The contravariant components 
are the components of x in the oblique system H, and the covariant com- 
ponents are those in the reciprocal system S*. Note in this connection 
that, for the two oblique co-ordinate systems, the property of being 

•These names are chosen with regard to the manner in which the components behave 
when subjected to a co-ordinate transformation. Thus the sets of variables and 

{*1 * • • are said to be contragredimi because when one set is subjected to a nonsingular 
linear transformation, the other is subjected to a linear transformation with the reciprocal 
matrix. For the moment, these matters need not be of interest, but they may be demon- 
strated from Eqs. 92 and 97 which follow. 
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reciprocal is strictly a mutual one. Hence the may just as well be 
looked upon as the contravariant components of in which case the 
fit’s become the covariant components. 

According to Eqs. 43 and 55, 

[71] 

Hence, by Eqs. 34 and 46, 

3a,f] = 8f] = f*] [72] 

It is thus seen that the covariant and contravariant components of the 
vector .V are related by the matrix 8. 

Another interesting pair of relationships is obtained through forming 
the scalar products • .v and * v, using for .r the Eqs. 26 and 25 
respectively, and noting the conditions .^1. d'his procedure gives 

(Jk • = s*c [73] 

and 

a* t • -v = fA- [74] 

With reference to Fig. 1, • ic for ^ = 1 and 2 are the orthogonal 

projections OM and OX of the vector .r uix)n the a.xes and S-j. 
Equation 73 states that the lengths of these projections, measured with 
the scales of the Ei and E 2 axes, are numerically ecjual to the covariant 
components f*i and f *2 of and may be substituted for these in Eq. 70 
in evaluating the length of the vector x. Similarly, a*k • xior k — I and 2 
are the orthogonal projections OM* and OA * of the vector x upon the 
axes and Equation 74 states that the lengths of these projections, 
measured with the scales of the and H *2 axes, arc numerically equal 
to the contravariant components fi and {2 of x. 

In view of the relation 73, the co variant components of the vector x 
are sometimes designated as the orthogonal projections of the iK)int P in 
Fig. 1 upon the axes 1 and 2 of the system H (as contrasted with the 
parallel projections which are the contravariant components). Although 
this procedure is justified numerically, and does away with the necessity 
of considering the reciprocal axes (or drawing them in the case of a 
graphical determination), it fails to give the true geometrical picture 
regarding the nature of the covariant components. 

The scalar product of two vectors x and y in terms of their co-ordinates 
in an oblique system is readily determined with the aid of the above 
considerations. The components of y in the systems AT, S, and S* may be 
denoted by ykj vky and rj*k- The corresponding column matrices which 



Art. 61 TRANSFORMATION IN OBLIQUE SYSTEMS 97 

represent these components are 



~yi' 





y] = 

y2 

-»]= 

V2 

v*] = 



jy^^ 




1 

p* 

I 


The matrix Equations 43 and 55 written for x and y 

x] = = a*,?*] [76] 

y] - Cf,,] = a\r,*] [77] 

admit of four alternative representations for the scalar product 

xy = x]tX y] [78] 

They are 

* • >- = i]f X (?(3, X 7,] = f], X 8 X >?] [79] 

= {*], X a*Q*, X V*] = X 9* X V*] [80] 

= {], X QQ*. X ,*] = {]<,*] [81] 

= {*], X G*Q. X ,] = {♦]„] [82] 


The first two of these results are similar in form to the Eqs. 48 and 66 
except that the terms are bilinear in and rjk respectively f*, and rj^k) 
instead of being quadratic in (respectively $*,). The last two are the 
mixed forms, which read 

X ' y = (iv* I + i2V* 2 + * * • + inV*n [83] 

X ' y — + $*2^2 + • • • + f* nVn [84] 

The first of these involves the contravariant components of x and the 
covariant components of y; in the second fomi these two types of com- 
ponents are interchanged. 

Except for the appearance of the two kinds of components, the ex- 
pressions 83 and 84 parallel the customary one in rectangular co-ordinates, 
namely 

A* • y = Aiyi + A2y2 + • • • + Xnyn [85] 

In a rectangular (orthogonal) co-ordinate system, the covariant and 
contravariant components are identical, since the system is its own 
reciprocal. 

6. Transformation from one oblique system to another 

In addition to the oblique co-ordinate system characterized in terms 
of the rectangular system X by means of the matrix Cf, a second oblique 
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system may be considered as characterized by a matrix vj 6 with the set 
of unit vectors 

hk = U\bk\ + ^2^*2 + * • * + (A’ = 1, 2, * * * u) [86] 

The reciprocal of this second oblique system has the set of unit vectors 
b* k — ihb*k\ 4 - U 2 b*k 2 + • • • -f Unb^kn (A’ = 1 , 2 , • • • n) [ 87 ] 


which is the vector set of the reciprocal matrix fB*, that is, 


bi • b^k 


I for / = k 
0 for / k 


[ 88 ] 


The contravariant and covarianl comj^)ncnts of the vector .v in the 
second oblique system are denoted by u- and with the column 
matrices i*] and f*]. Then the vector x, in terms of its comix)nents in the 
rectangular system A', has the following representations: 


x] = cf.^] = Q\e] 


[89] 


These relations readily yield the transformations from either of the 
variables ft or to either of the variables {“a or associated with the 


second oblique system and its reciprocal. 

For example, from the matrix equation 

a, I] = 

[90] 

premultiplication by 

[91] 

gives 

Sfl, X f] = f*] 

[92] 


which relates the contravariant components of x in the first oblique 
system (these are the to the covariant components of x in the second 
oblique system. The resultant transformation matrLx in this case is 

e = £B X flf [93] 

If this is written out as 

j C\i Ci2 • • Cin 
P _ ^21 <^22 * • • C2n 


L^nl I'n2 ’ * I'nn 

With the vector 

Ck = UiCkl + U2Ck2 + * • * + UnCkn = 1, 2, • • • w) [95] 


[94] 
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then it is clear from Eq. 93 that the elements of C are given by the scalar 
products 

Cik = bi • ak [96] 

Hence the elements of the resultant transformation matrix in Eq. 92 
are seen to be the orthogonal projections of the unit vectors ak upon the 
axes of the second oblique system (measured with the scales belonging to 
the axes of this system), or alternatively they may be considered as the 
projections of the unit vectors bk upon the axes of the first oblique 
system, measured with the corresponding scales carried by these axes. 

The transformation 92, with its resultant matrix determined as just 
described, transforms the contravariant components of x in the first 
system into covariant components in the second oblique system. 

Because of the mutual character of the relation between an oblique 
co-ordinate system and its inverse, as |X)inted out in the previous article, 
Eq. 92 evidently remains true if is replaced by and simultaneously 
r is replaced by f*. This shift gives 

X «] = f] [97] 

which is an equation relating the contravariant components of x in the 
first obli(jue system to the same kind of components in the second system. 
The elements of the resultant transformation matrix are determined and 
interpreted geometrically just as for the transformation 92 except that 
the reciprocal of the second oblique system with its unit vectors b*k 
replaces the unit vector bk with its corresponding axes. 

Further detailed discussion of these transformations becomes decidedly 
awkward in terms of matrix notation, as the reader can at this point 
readily appreciate by continuing the formation of various additional 
obvious relationships. Indeed, it is tliis circum.:>tance which is the best 
justification for the introduction of the notation and the conventions of 
tensor algebra. The most important point in this notation lies in the 
designation of components in the reciprocal co-ordinate systems (co- 
variant quantities) by louver indexes (subscripts) and those in the given 
co-ordinate systems (contravariant quantities) by upper indexes (super- 
scripts) instead of by the placing of an asterisk on these quantities. 
The circumspection which results from this simple artifice alone may 
readily be appreciated even by the reader who has as yet no familiarity 
with the tensor notation. 

However, a detailed discussion of these matters at this point would 
lead the reader too far afield from the present objectives, which are to 
lay a general foundation for a more thorough consideration of this subject 
at a later time. 
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7. Systems of linear algebraic equations 


As a result of the geometrical interpretations given in the preceding 
articles the problem of finding solutions to a set of linear simultaneous 
algebraic equations may be discussed with a greater degree of clarity 
than is possible from a purely algebraic point of view. To start with, the 
number of equations is considered equal to the number of unknowns. 
The equations read 


with the matrix 


1 Ti + ai 2 X 2 4" 

4“ 022^2 


4“ 

4- a2n-^‘n 

= vi 

= V2 

[98] 


i2*^'2 4- 


4“ OnfiXfi 

= yn 




(I 12 




a = 

aoi 

(122 

* ' * ^2n 


[99] 



^n2 

a nn _ 




The components a, it, a*it, and yk of the vector set and the vectors x 
and y respectively for ^ = 1, 2, • • • n are the ordinary comix)ncnts with 
resp)ect to a rectangular Cartesian co-ordinate system in an n-dimensional 
space. 

The vector set a, (for f = 1, 2, • • n) and the vector y are specified, 
and the problem is to find the corresponding vector x. The problem of 
particular interest according to the discussion in Art. 9, Ch. I, is the 
question whether a vector x can be found to satisfy the transformation 
98 when y is specified to be zero. In other words: Can the transform of a 
vector X be zero? 

In this case Eqs. 98 become homogeneous. In vector form they read 

di • X = 0 

a2 • X = 0 


• X = 0 


[ 100 ] 


According to these equations, the vector x (if one exists) must be simul- 
taneously orthogonal to all the vectors ai, a 2 , • • • an in the vector set of d. 
The possibility of the existence of such a vector evidently depends upon 
the rank of this vector set. If the rank is n, then, according to the dis- 
cussion in Art. 2 above, the vector set utilizes all the available n dimen- 
sions, and it is clear that no vector x can be found to satisfy the conditions 
100. In this case the determinant of d is not zero. 

On the other hand, if the rank of the vector set of (i is (n - 1 ), one of 
the available n dimensions is left unoccupied. For example, if in three- 
dimensional space the vectors oi, fl 2 » and a^ lie in a plane, only two dimen- 
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sions are utilized by this vector set, and any vector x normal to this plane 
constitutes a solution to the homogeneous Eqs. 100. The vanishing of the 
determinant of (2 is seen to be the necessary and sufficient condition for 
the existence of such a solution. 

At the same time it is also clear that the solution merely specifies a 
direction for the vector .v; its length is arbitrary. This checks with the 
conclusion reached in Art. 9 , Ch. I, on purely algebraic grounds, according 
to which only the ratios of the unknowns JCi, 0:2, • • • :rn to each other are 
determined from the homogeneous equations. The direction cosines of 
the vector .v are given by the expression 


Xk 


Xk '' 


\/r,- + .v..>- + ■ • • + x„^ 


/V 


Xo“ 

Xk' 


+ -2 
Xk 


[ 101 ] 


for it = 1, 2, • • • «, and this expression involves only the ratios of the 
unknowns. 

If the rank of the vector set of (i is in — 2 ), two of the available dimen- 
sions are left unoccupied. For example, if in three-dimensional space the 
vectors Ui, tij, and U3 lie in a straight line, any vector x lying in the plane 
normal to this line is simultaneously orthogonal to all three vectors in the 
set, and hence any such vector constitutes a solution. Although an 
infinite number of .r-vectors may be drawn in this plane, only two inde- 
pendent ones can be found in this way. Hence when the rank of the matrix 
(2 is (w — 2), tu'o indeixjiident solutions exist to the set of homogeneous 
Eqs. 100. 

In general, then, if the rank of (2 is (w — />), the homogeneous equations 
have p independent solutions. In any case, however, the length of the 
vector X* for any of the solutions remains arbitrarx"; and if the rank of 
Q is r < «, the equations determine only r of the Xk% in terms of the 
remaining ones. Choosing arbitrary sets of values for these remaining 
Xk^ permits any number of solutions to be obtained, of which no more 
than p can be linearly independent. The A*-vectors corresponding to p 
independent solutions are in general not mutually orthogonal to each 
other, but a set of /> mutually orthogonal solutions may be found by 
proceeding in the following manner. 

If the vector set ai, a2, • • <7„ has the rank r, it is surely possible to 
find f linearly independent vectors in this set. The numbering in the 
original set may be changed if necessary so that the vectors ai, a2, • * • flr 
are the linearly independent ones. Then the r equations 

filial + ^12^2 H h ain:r„ = 0 

<^21^1 + <^22^2 + * • • + <^2n^n = 0 11021 


^rl^l 4" (Ir2^ -j- . . . drn^n 0 
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uniquely determine r of the Xk's in terms of the remaining ones. That is, 
a set of values 

xt = otik (for ^ = 1, 2, • • • ») [103] 

of which p = n — r are arbitrarily chosen (not more than ^ — 1 of these 
may be chosen equal to zero, however) may be considered to be the 
components of the first x-vector constituting a solution. This is the vector 
X = ai with the comfX)nents aik- 

Now the vector set • • • ar is augmented by having the vector 
associated with it. This augmented set is surely an independent one, 
because ai is orthogonal to all the vectors ai • • • ar* Hence the r -f 1 
equations 

+ ^12^*2 -f- • * * 4* ainXn = 0 


arl.Vi -f + • • • + OrnXn =0 ^ 

<3^1 iXi -}- ^12^*2 “h * * * 4* OCinXn = 0 

uniquely determine r 4- 1 of the XkS in terms of the remaining ones. 
That is, a set of values 

Xk = a 2 k (for ^ = 1, 2, • • * w) [105] 

of which — 1 are arbitrarily chosen (not more than — 2 of these may 
be chosen equal to zero) represents the components of a vector .v consti- 
tuting a second solution which is orthogonal to the augmented vector set 
ai • • • Grj ai, and hence orthogonal to the first solution x = ai. This 
second solution is the vector x = a 2 with the components a 2 k^ 

A third vector x = aa with the components a^k is found in like manner 
by solving the set of r 4- 2 homogeneous equations corresponding to the 
linearly independent vector set aj • • • ar, ai, a 2 - These solutions are the 
components 

Xk = ask (for ^ = 1, 2, • - ») [106] 

of which ^ — 2 are arbitrarily chosen. 

A continuation of this process finally yields a set of mutually orthogonal 
vectors ai, as, •• • ap, all of which are orthogonal to all the vectors in the 
set ai, 02 , • • • Or. Such a set of mutually orthogonal solutions is, however, 
not unique, because of the arbitrary choice of />, p — 1, — 2, etc., 

Xifc- values at the various steps in the process of solution as described above. 

The case for which the rank of the vector set of (2 is r = n — 1 is most 

frequent in its occurrence in practice. Here only one independent solution 

(direction for the x-vector) exists. Because of the relation expressed by 
Eq. 46 of Ch. I and because of the fact that ^4 = 0 for r = w - 1, the 
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following relationship holds in this case (for all values of i and k from 
1 to n) 

O'ilAkl + Ui2'4ik2 + * * * + O^inAkn = 0 [107] 

in which A kB are the cofactors of the elements a*, formed from the de- 
terminant A of the matrix Q. 

If A k is defined as a vector having the components Akf^ the relation 
107 in vector form reads 


di Ak = 0 


[108] 


which states that the vector is simultaneously orthogonal to all the 
vectors in the set a 2 > * * * ^n- This vector, therefore, constitutes the 
desired solution. 

Note that the cofactors Aka for 5 = 1, • • • n are formed for the ele- 
ments of the ^th row of (2. Any row may be chosen. If, however, all the 
cofactors of the elements of that row happen to be zero (as is possible 
because the rank is w — 1), the solution assumes an indeterminate form. 
However, since the rank is assumed for this case to be not less than » — 1, 
a row can surely be found for which not all the cofactors of the elements 
are zero. 

The direction cosines of the vector x constituting the desired solution 
are given by 


cos 6g 




+ Aji-o* + • • • + Akn^ 


[109] 


in which dg is the angle between the direction of the vector x and the axis 
s of the rectangular co-ordinate system. 

Another problem of special interest in connection with Eqs. 98 occurs 
when the rank of the vector set of Q is less than n, but the y, -values on 
the right-hand sides of these equations do not l.appen to be zero. That is, 
the equations are inhomogeneous but the determinant of their matrix is 
zero. In Art. 9, Ch. I, it is pointed out that solutions may still exist if the 
y»’s have values satisfying Eq. 56, which reads 

y.- = ai«il + «2<3!»2 + • • ' + <Xndin 2, * • • w) [110] 

but that the solutions assume an indeterminate form when expressed 
according to Cramer’s rule. 

The aifc’s in Eq. 110 are arbitrary factors. In vector form this equation 
reads 

y = (Xidi^ “h a 2 U 2 ^ + * * • + oitidn^ [m] 

and states that the vector y is given by a linear combination of the 
transposed vector set of Ct. By making use of the relations 110, Eqs. 98 
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may be rewritten in the form 

<^11-^* 1 + <^12*^* 2 "h * ‘ * 4* = 0 

4- </22-V^2 4” * * * 4- (l2nX\ — 0 

+ <^n2^''2 4" • * • 4- = 0 

in which 

x'k = -Va- Otk 

Or if ai, ao^ * • • are considered to be the comix>nents of a vector a, 
and x'l, a'o, ■ • • a*'„ the components of a vector x\ 

x' = X — a and x = a:' -h a [114] 

The vector x' is a solution to the homogeneous equations 112. The 
problem of finding solutions a*' when the rank of Cf is r < « is discussed 
above, and the desired values of the vector a* are given in terms of those 
for A'' by Eq. 114. 

When the vector y, in this type of problem (for which r < w)> cannot 

be expressed in the form of Eq. Ill (in this case y is not a member* of 

the transposed vector set of Cf), the original set, Eqs. 98, is said to be 
inconsistent, and no solutions are jxissible. 

It is interesting that the discussion of this special problem incidentally 
offers also an alternative interpretation to the solution of the general 
inhomogeneous equations 98. This case is contained in the solutions to 
the homogeneous equations 112 for a' when the rank of (i is n. I'hen the 
application of Cramer’s rule yields a*' = 0; that is, a*'i = a *'2 = ■ • • a‘^„ = 
0, and the desired solution for x follows from Eq. 1 14, namely, for r = 

A: = a [115] 

In this case, Eq. Ill yields the following representation for the vector y: 

y = XiOi^ 4- X 2 a 2 ' + • • * 4- [116] 

which is an interesting alternative form for the linear nonsingular trans- 
formation. 

It appears from this discussion that the necessary and sufficient con- 
dition for the existence of solutions to the inhomogeneous equations 98 
is that the vector y be a member of the transposed vector set a/, 02 , 
• • • an\ regardless of the rank of Q. If the rank is «, then y must neces- 
sarily be a member of the transposed vector set (no matter what the 
values of yi • • • yn may be) because the rank of the augmented set 
(h\ * * * y cannot be greater than n (see footnote below). Then 

•If the rank of the transposed vector set and of that set augmented by the vector y are 
alike, then y is called a member of the transposed vector set, and hence is expressible in the 
form given by Eq. 111. 


[ 112 ] 

[113] 
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there exists a unique solution for x which is given by Cramer’s rule. 
When the rank of (2 is less than «, no solutions exist unless y is a member 
of the transposed vector set of (2, in which case n — r linearly independent 
solutions can be found. 

A summary statement regarding the possibility of finding solutions to 
Eqs. 98, known as the rule of alternatives, may now be formulated, as 
follows: 

Either the matrix (2 of Eqs. 98 has the rank n, whence A 0, and 
there exists one set of values ^i, X 2 , • • • which satisfies the equa- 
tions for arbitrary values of Vi, y^,* • Vn — in particular the trivial 
values .Vi = x? = • • • = = 0 resulting for yi = y 2 = * * * = 

y„ = 0 - these solutions being given by Cramer's rule, 

or the matrix (? has the rank r — n — p ip 7 ^ 0 ), whence A == 0, and 
the corresjx)nding homogeneous equations have p independent 
solutions. The inhomogeneous equations then have solutions for 
particular values of yi. y 2 , • • • Vn only, namely those values for which 
the vector y is expressible as a linear combination of the transposed 
vector set of (2. To these particular solutions may be added those for 
the corresponding homogeneous system. 

The more general case of m equations with n unknowns 

<^11-^1 ■+■ + * • • + = yi 

+ * • * + <?2n-V,i = Vo ril7l 


~h ^^w2'^2 -f- * • * T* OjjiriXn ym 


may now readily be disposed of. Multiplying the equations respectively 
by the unit vectors «i, Uo, • • • Wm associated with the m axes of a rec- 
tangular co-ordinate system, and adding, yields the result 


y = .v'ld/ -f a* 2 U 2 ^ "b • ' 

• • r x„an‘ 

[118] 

where 



y = uiyi + W2y2 + • • 

• + Umym 

[119] 

and 



<^k = WiJijfc + U2(l2k + • 

• • + Umfimk 

[120] 


are the vectors in the transposed set of the matrix (2 of Eqs. 117. 

The result given by Eq. 118 states that for any set of values Xi, X 2 , 
• • Xn which simultaneously satisfies the m equations 117, the vector y 
is a member of the transposed vector set of 3. In other w'ords, unless y 
is a member of this vector set no solutions are possible, regardless of the 
rank of the set. This condition, which is arrived at above for the special 
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case m = », is thus seen also to be the condition for the existence of solu- 
tions in the more general case m 7 ^ n. 

When m 7 ^ n this condition is more significant, for if the rank is n 
and m = w, the condition 118 is automatically fulfilled; whereas when 
m > n, for example, and the rank is w, y may or may not be a member 
of the transposed vector set- In other words, when m = n and the rank 
is «, there is no question about the existence of solutions to the inhomo- 
geneous equations; but when m 7 ^ n and the rank is n{<m) or ;;/(<;/), 
the existence of solutions still depends u[K)n the fulfillment of the con- 
dition 118. 

If this condition is written 

y — Oritj/ + (XoCl'y^ ^ ^ OtnQn [^21] 

and oti, 02 , • • • otn are regarded as the comjxments of a vector o, then by 
letting 

jr' = .r-a x = .r' + a [122] 

as in the problem for m = «, the question of solutions to Eqs. 117 may 
be discussed for the inhomogeneous and the homogeneous cases simul- 
taneously, because the resulting equations 


dliX 1 + (/lo.X 2 “H * ' 
' 1 ' 1 
^21-^* 1 1 (^22'^ 2 1*' 

• • + flln/n = 0 
’ • + = 0 

[123] 

aml^\ + (lm2^^ 2 + * ' 

’ ’ “1” n ~ ^ 



apply to the inhomogeneous equations 117 for a 5 *^ 0 and to the corre- 
sponding homogeneous equations for a = 0. 

As discussed in Art. 2 of this chapter, the rank of the nonsquare matrix 
Q of this set of equations is at most equal to the smaller of the two 
numbers m and n.lim > », then r ^ n. Out of the ;w equations 123 it is 
possible to select r independent ones (the coefficients of these define r 
independent vectors in the vector set of G). For these equations, n — r 
independent solutions may be found by the procedure discussed for the 
case m ^ n and r ^ ». In particular, for r = n only the trivial solution 
Of' = 0 exists, which yields x - a. The solutions so found automatically 
satisfy the remaining m — r equations in the set 123 because the coeffi- 
cients of these equations are the components of those vectors in the vector 
set of G which are linear combinations of the others. 

When w < », then r ^ m. Equations 123 then yield n - r independent 
solutions which again are found as for the case m = n and r ^ m. In 
partictilar, for r = w there are « — w independent solutions. 
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8. On the rank of matrices having a null product 

If a and are two square matrices of order their product 

e = G X ffl [124] 

is also a square matrix of order n with coefficients given by the scalar 
products 

Cik = ai-bk‘ [125] 

in which fli, fl 2 > • ** is the vector set of Q and 6/, bn is the trans- 

posed vector set of ffl. 

All the vectors a,- can be orthogonal to the vectors bk^ in which case 
all the coefficients C{k are zero. It thus becomes clear that the matrix 
product Q X may be zero even though neither G nor is zero. This is 
an important difference between the laws of matrix algebra and those of 
scalar algebra, according to which the relation ab = 0 requires that 
either a or b be zero. 

The vector interj^retation of Eq. 125 allows the following conclusions 
to be drawn when C = 0: 

(a) If G is of rank then ffl must have the rank zero; that is, in this 
case = 0. 

(b) If neither G nor is zero, then the rank of both must be less than 
n. 

The first of these conclusions follows from the consideration that if the 
vector set has the rank n (occupies all the available dimensions) no 
vector exists which is simultaneously orthogonal to all the vectors a,-; 
that is, bk = 0. In order that any vectors bk may exist, the vector set a, 
must evidently have a rank less than n, and conversely the rank of the 
vector set bk must be less than n in order that any nonzero ai vectors 
may fulfill the condition u, • bk = 0. 

If the null condition 

a, • i,' = 0 [126] 

be compared with the system of homogeneous equations 

ai • X = 0 [127] 

the discussion of the previous article yields the conclusion that if the rank 
of the matrix G is fa ^ w, then « - r « independent vectors bk exist which 
fulfill the null condition, and hence the rank n of the matrix £B may be as 
large as but no larger than n — ra\ that is, -- fa. Thus if 

G X ® = 0 


[ 128 ] 
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then 

fa + n^n [ 129 ] 


in which fl and are square matrices of order n with the ranks fa and 
n respectively. 

It should be observed that the converse of this statement is not neces- 
sarily true, nor does it follow that the reversed product £63 is zero when 
3SB is zero. 

An interesting application of this result is found in considering the 
product of a matrix 3 and its adjoint 3“. From the definition of the 
adjoint matrix (see Art. 6, Ch. II) and determinant theory (see spe- 
cifically Eqs. 45 and 46 of Ch. I), it follows that 


3 X 3® = 3® X 3 


’A 

0 

0 

• 0 

0 

A 

0 • 

• 0 

_0 

0 


A 


= [130] 


in which A is the determinant of 3. Hence if the rank of 3 is (» — 1), 
i4 = 0, and according to the relations 128 and 129 the rank of 3® can 
be no greater than 1. Since the elements of 3® are the cofactors of 3, and 
at least one of these is not zero because the rank of 3 is (w — 1), the 
rank of 3® cannot be less than 1 . Hence the summary : 

If 3 has the rank n, then 3® has the rank n. 

If 3 has the rank w — 1 , then 3® has the rank 1 . 

If 3 has any rank less than w — 1, then the rank of 3® is zero. 

In connection with considerations of this sort it is useful to observe 
that if a given square matrix 3 has the rank 1 , it possesses no more than 
one independent row and one independent column of elements, and hence 
it may be represented as the product of a column and a row matrix, thus: 


Ofl 

«2 


3 = 


Xlfii • • • j9n] 


L«nJ 


[ 131 ] 


Similarly if a given square matrix has the rank 2, it possesses no more 
than two independent rows and two independent columns, and may 
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evidently be represented as 

r «ii «i2 



L«nl otraA 

The generalization of these statements is obvious. 


[132] 


9. Sylvester’s law of nullity 


From the preceding considerations it follows that if a square matrix Q 
of order n is given, and if this matrix has the rank fa and the nullity pa, 
a nonsingular matrix can be found such that the product 

e = a X a [133] 


has pa null columns. For example, if pa of the columns of a are the 
components 6 u-, 62 *, ‘ • bnk of the vectors hk (^ = 1 , 2 , • • • pa)y which 
are independent solutions to the homogeneous equations 


ax -bk^ ^0 

a2 • = 0 


[134] 


On bk^ = 0 


Pa of the columns in C are composed of zeros. The remaining fa columns 
in a may be chosen arbitrarily as long as they are independent, that is, 
as long as a turns out to be nonsingular. 

There is no nonsingular matrix a for which C has more than pa null 
columns, because (since a is nonsingular) C has the same rank as Q (see 
Art. 9, Ch. II) and hence must have fa independent columns. If, how- 
ever, pk of the remaining columns in a are then chosen to be linear 
combinations of the above pa columns, C has pa + pb null columns, and 
a is singular with nullity pb- Moreover any matrix a with nullity ph 
which produces pa + ph null columns in C must have the form of the 
particular one chosen here, for pa + pb of its columns Inust satisfy Eq. 
134, and n pboi its columns must be independent. In addition, it is 
not possible for any matrix a of rank fb to produce more than pa + pb 
null colunms in C, because any such new column requires that another 
column in a become a solution to Eq. 134, and thereby be a linear 
combination of pa of the independent columns already in a. The rank of 
a would then have to be less than tb- The nullity of C is therefore at least 
Pa + pb* But it cannot be greater than this l^cause, if it were, post- 
multiplication of Eq. 133 by a nonsingular matrix 3* could produce 
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another null column in C. This would demand that SBT be a matrix of 
rank th producing more than Pa -f ph null columns in C, which has been 
shown to be impossible. Hence no matrix SB of nullity pb can give C a 
nullity greater than pa *4- Pbi that is, 

pc^pa + pb [135] 


Of course pa + pb niay exceed w, in which case e\idently pc cannot be 
greater than n. 

Now let it be supposed that two nonsingular transformation matrices 
T 4 and T b are determined such that 

G' = T 4 G [136] 


is a matrix whose first fo rows are independent and the remaining ones 
null, and 

SB' = SBT^ [137] 


is a matrix whose first tb columns are independent and the remaining 
ones null. Then 


T 4 GSTB = G'a' = = e' [138] 


has the same rank as C, but is evidently in the form 


Cn • 


0 • 

0 " 


* ^ran 

0 • 

• 0 

0 

. . . 


0 

0 



0 _ 


[139] 


It then becomes dear that the rank of C' and hence that of C cannot 
exceed fa or r^,, whichever is the smaller. Or the nullity of C must be at 
least as great as pa or pb, whichever is the larger. 

Together with Eq. 135 this conclusion yields the summary 

g />a + Pb [140] 



or stated in terms of rank, 

Tc ^ fa + f6 - » [141] 



This useful statement regarding the rank or nullity of a product matrix 
in terms of the ranks or nullities of its components is known as Sylvester's 
law of nuUUy (or degeneracy). 
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10. Reduction of a square matrix to the diagonal form of 

ITS LATENT ROOTS; NORMAL CO-ORDINATES 


In view of the vector interpretation of the linear transformation 


4“ * * • + O'lnXn = yi 


OnlXi -f • • • + dnnXn = 


[142] 


as discussed in Art. 3 of this chapter, it is interesting to inquire whether 
there exists a vector x whose transform y has the same direction in space, 
that is, whether a solution to Eqs. 142 exists which conforms to the 
stipulation 

= \x [143] 


or more specifically 

y. = X:r. (for i = 1, 2, • • • n) [144] 


in which X is a nonzero* factor. 

Substituting the values given by Eq. 144 for the right-hand members 
of Eqs. 142, and transposing these to the left-hand sides, yields the condi- 
tions 


(an — X).X*i -f 012^2 4" • ' * + CllnXn = 0 

(i2lX\ + (<^22 - '^)X2 H + (22nXn = 0 


+ an2X2 H h {ann - X)a\ == 0 


[145] 


This set of homogeneous equations has nontrivial solutions only when the 
determinant of its matrix (called the characteristic matrix of (i) is zero, 
that is, when 


(an — X) ai2 
021 (022 ““ 





0^2 • * * ((^nn 


[146] 


This determinant is called the characteristic function of the matrix Q.. 
It is evidently a polynomial in X of the «th degree. The condition equation 
146, which is called the characteristic equation^ is an algebraic equation of 
the nth degree in X, and hence there are n values of X for which the 
matrix of Eqs. 145 has at most the rank (n — 1). These so-called latent 
roots or characteristic values may be denoted by Xi, X 2 , • • • Xn. 

In the general case for which the matrix Q of the transformation 142 is 
unsymmetrical, the latent roots may be real or complex. Also they may 

•The stipulation X * 0 leads to the inquiry regarding the existence of solutions to the 
corresponding homogeneous equations, which has been discussed previously. 
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be distinct, or there may be coincident roots. If all the latent roots are 
distinct, the rank of the characteristic determinant for any particular 
latent root cannot be less than (» — 1); for if it were, all its {n — 1)- 
rowed minors would be zero, and hence its derivative with respect to X 
would also be zero when X = X,. This fact would indicate that X* must be a 
multiple root of Eq. 146, which contradicts the assumption that the 
latent roots are distinct. 

When the roots are distinct, one may conclude that there are n distinct 
directions for the vector x which remain unchanged by the transformation 
142, that is, there are n directions for which the vector y coincides with 
the vector x. These are given by the w vectors .f, i*, • • • x w^hich are found 
to be solutions to the n sets of homogeneous equations 145 resulting from 
substituting successively the characteristic values Xi, Xo, • • • K for X. 

If the cofactors of the elements of the determinant in Eqs. 145 for the 

s 

root X = X, are denoted by the direction cosines of the vector x are, 
according to Eq. 109, given by 

^ --■.TZ.— :== = (fori = 1,2,- ••«) [147] 

+ iky + • • • + {Ly 


in which the index i is arbitrary but must, of course, be the same for the 
determination of all the direction cosines corresponding to a given 
root X,. 

The components Xi,X 2 y ■ ■ in o( the vector x may be taken equal to the 

» s t 

cofactors kn, ki 2 , • • • kin respectively {i arbitrary) or to any multiple of 
these values since the length of the vector is not determined by Eqs. 145. 
Unit length for the vector x results when its components ii, i 2 » * • * in 
are taken equal to the direction cosines fu, respectively. 

With the aid of the column matrices 


9 

^2 


■/u‘ 

* 


• 

9 

L^nJ 


1 

9 

• 

1 


[148] 


the transfomation 142 for the condition 144, appropriate to the root 
X = X(, may be expressed by the matrix equations 

Qx] - X,i] [149] 


or 


at,] = X./,] 


[ISO] 
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There are n matrix equations of this form for the n roots corresponding 
to the integer values 1, 2, • • • n for the indexes. 

These n matrix equations may be combined into a single one by de- 
fining the matrix 


11 

fl2 * 

• fin 

f 

21 

^22 * 

f2n 

nl 

1.2 • 

f nn 


[151] 


in which the first column is the column matrix 148 for 5=1, the second 
column is the column matrix 148 for 5 = 2, and so forth. The set of n 
matrbc equations 150, for 5 = 1, 2, • • « is then given by 


as. 


Xifn X2fi2 ■ * Xnfin 
Xif21 ^ 2^22 • * Xnf2n 


LXlfnl X2fn2 


Xnf n 


[152] 


or, by the introduction of the diagonal matrix 

[X, 0 0 0 

. 0 X 2 0 • • 0 


0 0 • • • X„ 


this is 


a£ =£a 


[153] 


[154] 


It may now be shown that when the X-roots are distinct the matrix S 
given by Eq. 151 is nonsingular or that the n vectors x, • • x whose 
directions remain unchanged by the transformation 142 form a linearly 
independent set. The proof is readily given by assuming that this vector 
set is linearly dependent and showing that such ar. assumption leads to an 
absurdity.* 

The assumption that the vector set i, • • • x is linearly dependent is 
equivalent to the statement that the vector relation 

+ 72 ^ + * ' * + yn^ = 0 [^ 55 ] 


can be satisfied for 7-values other than all zeros. This vector relation is 
alternatively expressed by 

i 7.i] = 0 [156] 

•The proof given here was suggested by Professor D. J. Struik of the Department of Math- 
ematics at the Massachusetts Institute of Technology. 
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Miiltipl 3 rmg Eq. 149 by y, and siunming over s gives 

i ayj] = a i: = i 7 .x^] [157] 

a«»l «*1 a-1 

and in view of Eq. 156 this yields 

i 7aX,i] = 0 [158] 

Repeating the same process but with Eq. 149 multiplied by X, in addition 
to and this time using Eq. 158 instead of Eq. 156, gives 

i ysK^x] = 0 [159] 

a =1 

In other words the assumption expressed by Eq. 155 or Eq. 156 together 
with the matrix Eq. 149 leads successively to relations of the form 

i 7.X.*i] = 0 [1*60] 

or in vector form 

7iXi*i: + 72 X 2 ^^ H h = 0 [161] 

in which the exponent k can be 0, 1, 2, • • . For k = 0, Eq. 161 is the same 
as Eq. 155. 

Since the X-roots are assumed to be distinct, this result means that 
any number of independent relations of linear dependence among the 
vectors i, i, • • 5 exist — clearly an absurdity. For exiunple, in three- 
dimensional space, the existence of one relation of linear dependence 
means that the vectors lie in a plane; the existence of two independent 
relations of linear dependence means that the vectors lie in a straight 
line; the existence of three independent relations of linear dependence 
requires that the vectors be zero. 

If two of the X-roots are coincident, it might appear that two of the 
vectors x become equal (or at most proportional), since they then repre- 
sent two solutions to the same set of homogeneous equations 146. Under 
such conditions the modal matrix £, given by Eq. 151, would become 
singular by reason of the two proportional columns within it. It must be 
recognized, however, that the solution 147 for the various vectors i = f, 
is based upon the fact that the rank of the characteristic determinant is 
exactly n — 1, and although it is surely true when the X-roots are dis- 
tinct, it may or may not be valid in the case of repeated roots. In fact it is 
shown in Art. 4, Ch. IV, that, if the matrix Q is symmetric, the occurrence 
of a repeated X-root of order p ^ n requires the rank of the characteristic 
determinant for that particular value of X to be n - p. It is then possible 



AH. W] 


REDUCTION OF A SQUARE MATRIX 


115 


by the methods discussed in Art. 7 of the present chapter to find p 
independent solutions to this single set of homogeneous equations, and 
to make these solutions orthogonal and of unit length if desired. Under 
such conditions, the present proof of the independence of the solutions 
for all the X-roots may be carried through without significant alteration, 
bearing in mind that the p solutions corresponding to the p equal X-roots 
are already chosen to be independent of one another. 

Unfortunately in the general case, when the matrix fl is not symmetric, 
there is no unique correspondence between the order of a repeated X-root 
and the rank of the characteristic determinant. The nullity of the char- 
acteristic determinant may be less (but not greater) than the order of the 
repeated root, in which case solutions for n independent vectors x cannot 
be found. For example, the unsymmetrical matrix 

‘1 1 6 ] 

Oil 

_0 0 ij 

has a repeated latent root X = 1 of order 3, whereas the nullity of the 
corresponding characteristic determinant is only 1. Since the rank of the 
latter determinant is 3 — 1 = 2, it is possible to find only one independent 
vector X, and therefore any attempt to form a modal matrix yields a 
singular result. Hence the existence of a nonsingular modal matrix £ is 
guaranteed when and only when the matrix Q is either symmetric, or 
has n distinct latent roots. 

In this case Eq. 154 may be premultiplied by £“^ to give 

£-'(!£ = A [162] 

The matrix Q is thus reduced to the diagonal form of its latent roots. 

The significance of this reduction, as far as the transformation 142 is 
concerned is recognized when the latter is written in matrix form, 


ax] = v] 

[163] 

and the changes of variable expressed by 


jc] = £a:'] and y] = 

[164] 

are considered. Then, in view of Eq. 162, the transformation 163 becomes 

A*']=/] 

[165] 

which represents the equations 


Xi*'i = y'\ 


X2*'2 = y'i 

[166] 

n ^ y n 
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It thus becomes dear that the change of variable expressed by Eq. 164, 
which amounts to transforming from the co-ordinates x and y to new 
co-ordinates jc' and y', reduces the set of simultaneous equations 142 to a 
set of n distitict equations, the solutions of which are immediately set 
down. Another way of looking at this situation is to say that in the new 
co-ordinate system to which x' and y' refer, the variables x'l • • • x'n are 
no longer dependent upon each other as are the variables • • • Xn in 
the original co-ordinate system to which x and y refer. 

In connection with physical problems, the new co-ordinates in which 
the equilibrium of the system appears to be expressed in terms of separate 
equations are called the normal co-ordinates of the system. The trans- 
formation to normal co-ordinates, which is efTected by the matrix £, 
apparently isolates the various modes of behaWor of the physical system, 
and for this reason the matrix £ is spoken of as the modal matrix. 

In view of the fact that the vector x, which constitutes a solution to 
Eqs. 146, is determined in direction only, it may be noted that the matrix 
Q is still reduced to the diagonal form of its latent roots if in the relation 
162 the modal matrix £ has its columns multiplied by arbitrary nonzero 
factors. In other words, the matrix £ in Eq. 162 may be replaced by the 
product £12) in which 3) is an arbitrary nonsingular diagonal matrix. 
Since the inverse of £2) is 3)”^£“"^ and = A, the truth of this 

statement is evident. 

The type of transformation 162 to which the matrix Q is subjected 
evidently corresponds to identical transformations for the variables, as 
shown by Eqs. 164. Thus a transformation of the matrix Q of the form 

e~»ae = £B [i 67 ] 

corresponds to a transformation of the variables as expressed by 

x] = Cx'] and y] == Cy'j [168] 

in which C is necessarily nonsingular. 

The geometrical significance of these relations can be studied through, 
first of all, recognizing that the transformation 142 may be looked upon 
as determining a point yi • • • yn lying on a plane through the origin of 
co-ordinates defined by the equation 

0iyi + p2y2 + • * • + Pnyn = 0 [169] 

corresponding to a given point Xi • • • Xn on a similar plane defined by the 
equation 

aiXi + a2X2 + • • • + Oln^n = 0 [ 170 ] 

It may be helpful in this connection to consider the coefficients ai • • * an 
to be the components of a vector a. Then Eq. 170 states that a • x = 0, 
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which requires that the vector x be orthogonal to the vector a, and hence 
that the point Xi • • • iCn lie in a plane normal to the direction of a. Equa- 
tion 169 may similarly be interpreted, and it follows from the solution 
of simultaneous equations that the components of the corresponding 
vector 0 are given by the relation 

0 k = k • a (for ^ = 1, 2, • • • w) [171] 

in which a*i • • • a*n is the vector set of the matrix Q* reciprocal to (?. 
According to Eq. 74, the coefficients 0k are thus seen to be the contra- 
variant components of a with respect to the oblique co-ordinate system 
defined by the vector set of fl. 

If the matrix Q is nonsingular, the transformation 142 yields a one-to- 
one correspondence between points on the two planes defined by Eqs. 
169 and 170. One may say that Eqs. 142 transform planes into planes 
and hence also straight lines into straight lines.* For this reason a linear 
transformation is also spoken of as a collimaiion. 

If the co-ordinates Xi - Xn and yi * • • yn define the points P and Q 
respectively, the collineation 142 relates any chosen points Pu P 29 • * * 
(not necessarily on the same straight line) to corresponding points 
Q 17 Q 2 f ' ' * • The foregoing discussion in this article shows that a non- 
singular collineation possesses n distinct points Pi • • Pn (called the 
fixed points) to which the corresponding points • • • Qn are so related 
that the directions OPk and OQk (O denotes the origin of the co-ordinates) 
are coincident, and OQk OPk = This so-called descriptive or projective 
property is evidently placed in evidence by the collineation which has 
the diagonal matrix A. The collineations with the matrices Q and A have 
the same projective properties, and are therefore spoken of as being 
equivalent. 

More generally, any two collineations with the matrices Q and SB related 
by Eq. 167 have the same projective property's. Equation 167, which 
expresses the equivalence of the two collineations, is called a collineatory 
iransfonnatum of the matrix Q. 

The characteristic matrices of 3 and SB are related by the same trans- 
formation. With the aid of the unit matrix ^ of like order, the character- 
istic matrices of 3 and SB are expressed by 

(3 - X<ni) and (SB - X^) [172] 

•The truth of the latter part of this statement may be seen from the fact that the trans- 
formation 142 relates points on a given continuous curve to points on another continuous 
curve. If a plane ^1 is transformed into a plane ^'i, and a second plane pi (not parallel 
to ^i) into the plane p'21 points on the linear intersection of pi and p 2 must correspond to 
points on the linear intersection of p\ and p\\ for these points arc common to the planes 

and Pi and, because of the continuity of the transformation, they must also be common 
to the planes p' i and P^ 2 . 
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But by Eq. 167 

(t‘\a - xGU)e = e-^ee - xe-'^ue = a - x^u [173] 

The characteristic function of iB (determinant of the characteristic 
matrix) is therefore 

|£B = x^ul = |e“>(G - x%)e| 

= cr^a - x^jc = ;a - x^ij [ 174 ] 

in which the bars enclosing the matrix expressions indicate that the 
determinant of the enclosed matrix is meant, and C is the determinant 
of the matrix C. 

The important conclusion to be drawn from this last relation is that 
the two matrices Q and fB, which are connected by the transformation 
167, have the same characteristic equation and hence have the same 
latent roots. In other words, the latent roots of a matrix are invariant to a 
coUineatory transformation. 

In this connection it is useful to note that Eqs. 162 and 167 yield 

= A [175] 

from which it is clear that the modal matrix for is given by the product 


11. The Cayley-Hamiltox theorem 


An interesting relationship follows from Eq. 162, namely 
(J X G = (?- = 


in which 



rv 

0 0 • 

• • 0 I 

A* = A -X A = 

0 

0 • 

•• 0 


. 0 

0 ••• 



[176] 

[177] 


This relationship may readily be generalized with the result that the wth 
power of a nonsingular square matrix G is given by 


with 


Qm 

= £a 



X," 

0 

0 • • 

• 0 ■ 

0 

X 2 ” 

0 • • 

0 

0 

0 


X 


[178] 

[179] 
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If P(X) is a polynomial in X, that is, 

P(X) = flpX** + ap_iX^ * + ••• + OiX + Oo 
the result expressed by Eq. 178 shows that* 

P(G) - apfl" + + . . . + fljG + 

= £P(A)£-> 


According to Eq. 179, however, 
P(A) = 


P(X.) 

0 


0 


0 0 
P(X2) 0 


0 


• 0 
• 0 


P(X„)J 


[180] 


[181] 


[182] 


and hence a polynomial in terms of a nonsingular matrix (? becomes 


P((3) = £-' X 


P(X,) 

0 


0 

PfXi) 


0 

0 


0 


0 


P(X„)J 


X £ [183] 


This result becomes particularly interesting when the polynomial P(X) 
is chosen to be the characteristic function of the matrix fl, that is, 

P(X) = jG - X6U| [184] 

Then P(X) = 0 is the characteristic equation and hence P(\i) = 
P{\2) = • • • = P{\n) = 0. The right-hand side of Eq. 183 is then zero, 
and the left-hand side is the characteristic function of fl with the matrix 
a ijsclf substituted in place of the variable X. The conclusion follows that 
the matrix (i satisfies its awn characteristic equation. 

The above derivation requires that the modal matrix £ be nonsingular, 
which means that the matrix d is either syminetric or has n distinct 
latent roots. In spite of these restrictions on the method of derivation 
employed here, it can be shown that the last result, which is known as 
the Cayley-Hamilton theorem, is true in general for any square matrix.f 
A numerical illustration is given by the following: 



♦It should be observed that the constant term in the polynomial P(Ct) is ooCl®, and that 
the zero power of a matrix is equal to the unit matrix of like order. 

tSee, for example, L. E. Dickson, Modem Algebraic Theories (New York: Benj. H. Sanborn 
and Co., 1926), p. 48. 



120 


LINEAR TRANSFORMATIONS 


[Ch. Ill 


The characteristic equation is given by 


which yields 


or in factored form 


(2~X) 

1 1 (4 - X) 


X® - 6X=* + IIX - 6 = 0 


(X - 1)(X - 2)(X - 3) = 0 


Hence the Caylcy-Hamilton theorem in this case states that 

ff * - C,(i- + 1 Iff - 6% = 0 [188] 

or in factored form that 

tff - “^Dlff - 2<4l)(ff - 3^) = 0 [189] 

Substituting the matrix ff from Eq. 185 into the last equation yields the 
identity 


1 -1 X -1 


-2 -1 -3 


0 -1 X 

1 2 


-3 -1 -3 

-1 -1 -1 

1 1 1 


0 [190] 


which the reader may readily verify for himself. 

It is worth noting from Eq. 188 that higher pcrwers of the matrix ff are 
expressible in terms of powers up to and including the (h — l)th. Thus, 
for example, 

ff® = bff® - llff + 

ff« = 6ff®-llff®+ 6ff = 250® - 60ff+ 366U [191] 

ff® = 25ff® - 60ff® + 360 = 900® - 2390 + 150% 
and so forth. 

In a similar manner, negative powers of ff may be calculated. Thus 
from Eq. 188 in the above exeunple 

6ff-* = ff® - 60 + 11% 

6ff-® = ff - 6% + llff-> = V®* - lOff + ^% [192] 

and so forth. In particular, the inverse of ff by this method is found to be 


ff-* = iff® - ff + = i 


9 1 7 

3 3 3 

-3 -1 -1 


[193] 
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12 . SYMMETRICAL TRANSFORMATION 


Returning to the discussion of Art. 10, note that when the matrix of 
the transformation 142 is symmetrical, the vectors x whose directions are 
left unchanged by the transformation form a mutually orthogonal set. 

The truth of this statement may be seen by considering Eq. 149 for 
two particular latent roots X, and X*, thus 

Qx] = Kx] [194] 

Gi*] = \kx] [195] 

The transpose of the matrices in Eq. 194, postmultiplied by x] reads 

xW] = KilJ] [196] 

and Eq. 195 premultiplied by the transpose of x] is 

x]fi^] = X,.r].^] [197] 


Now if Q is symmetrical, it is equal to its transpose G^, so that Eqs. 196 
and 197 then yield 

KxU] = X,i],^] [198] 

or 

(X, ~ X,)i-]4*] = 0 [199] 

Written out, this reads 




Jt 

Xo 


— iX, — X^.)(.X'i.X’i -hT2X2-f- • • * 'hXnXn) = 0 

[ 200 ] 


which is alternatively expressed by the scalar product 

(X. - Xit)i 1 = 0 


[ 201 ] 


Since this result states that the scalar product of two vectors corre- 
sponding to two different latent roots is zero, the vectors must be normal 
to each other. In the case of repeated X-roots, it is pointed out in Art. 10 
of this chapter that the corresponding x and x can always be chosen to be 
orthogonal. Thus the apparent failure of the conclusion drawn from 
Eq. 201 in this instance is not significant. 

In view of Eq. 201 it is also easy to show that the latent roots of a 
symmetrical matrix must be real. For if they were complex they would 
have to appear in pairs of conjugates,* for example, 

X* = a + jP and X* = a - 

‘Since the matrix is assumed to have real coefficients. 


[ 202 ] 
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and the vectors i and x satisfying Eqs. 194 and 195 would likewise 
have to be conjugate complex, thus 

= « + jv and & = u [203] 

Their scalar product is given by 

X ' i — u • u + V • V j{u ‘ V — V • u) [204] 

which is a real nonzero value. Hence Eq. 201 would demand that X, = \k 
or 

a -hjfi = « [205] 

whence 

/3 = 0 [206] 

In other words, the latent roots must be real. 

When the vectors .r form a mutually orthogonal set, the modal matrix 
given by Eq. 151 becomes orthogonal, because its elements are then the 
direction cosines of a set of mutually orthogonal axes. Hence when the 
matrix Q is symmetrical, its modal matrix has the property expressed by 

[207] 

and the transformation of Q to the diagonal form of its latent roots reads 

= A [208] 

In this case, the normal co-ordinates of the physical system whose 
equilibrium is expressed by Eqs. 142 are orthogonal to each other. 

Since the matrices encountered in practical problems are predomi- 
nantly symmetrical, the results of the present article are quite significant. 
These matters are elaborated upon in Ch. IV. 

PROBLEMS 


1. Determine the rank of each of the following vector sets: 


(a) j 

'( 5,3, -1) 
(10,0, 2) 
i(0,l, 0) 

(b) i 

1 

f( 3,4, -1) 
(-1.5,2, -0.5) 

l( 6,8, -2) 

(c) 

f(l,2, 4) 

U1.3, 9) 

1(1,4, 16) 

(d) ' 

f( 3a, 66) 
{(-Ua, 186) 

(e) < 

1(2, 2,2,2) 

1(3, 3, 3, 3) 

1(4, 4, 4, 4) 

(f) 

((1,1,1) 

I (1, e>(4»/3)) 

[(l^e->(2T/» -/(4./8)) 

(g) ^ 

(1, 2^ 2», • • 
(1, 3*, 3», • • 
(1,4*, 4’,-- 

■ 2""‘) 

• 3"-‘) 

• 4"“*) 





(l,n*,w*, 

• n"“*) 
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2. A given vector set di, 02 , • • • in an »>dimensionaI space (with tn < n) \s 
linearly dependent as expressed by 

Ofidi 4- 017^2 H + OtmOm = 0 

Show that all determinants of order w, formed from this vector set, vanish identically. 

What can you say about the rank of the vector set if aU the a’s have finite nonzero 
values? If all but one of the a’s are zero? 

Which of the following vector sets is linearly dependent? What is the rank of each? 

[( 6a, -2^-4^, 2) [(6,7,9,11,13) 

'i( a, h, c, d) \(2,4,6, 8,10) 

(,(-3a, ft, 2c, -1) 

3. Consider the vector set a i, a2, ••• a,n in an «-dimensional space with m = n -h 1, 
and suppose that the first n vectors ai, * • • a„ are linearly independent. Show that one 
may write 


Om = OfllOl -{-Oli2fl2 + * • • d-OrinOn 

and express the coefficients a u in terms of the determinant of the vector set ai, ■ • an, 
its cofactors and the components of the wth vector. 

Generalize this result for the case m = n -h p with p > I, and illustrate with the fol- 
lowing numerical example: 



-1, 

-2, 

-3) 

( 

2, 

4, 

5) 

( 

1, 

1, 

1) 

( 

1, 

3, 

5) 

l(- 

-1, 

0. 

1) 


4. Consider three vectors in three-dimensional space. The vector oi has a length of 
eight units and is directed at an angle of 45® relative to the co-ordinate axis 3, while 
its projection upon the 1,2-plane makes angles of 45® with both axes 1 and 2. The 
vector a 2 lies in the 1 ,2-plane, having a length of sLx units, and oriented at angles of 
30® and 60® respectively relative to the axes 1 and 2. The vector az has a length of 
four units and coincides with axis 3. Write down the matrix corresponding to this 
vector set. Express each of these vectors linearly in terms of the three unit vectors 
coinciding with the co-ordinate axes. 

5, The vector set ai, a 2 , • • • an in an ^-dimensional space has the rank — 1. 

Show that 

^ UikAnk =0 for I = I, 2, • • • n 
k^i 

and interpret the significance of these equations geometrically. Suppose that this 
vector set is so chosen that the components a^n have finite nonzero values. A set of n 
vectors fti, ft 2 , * * * ftn in an w-dimensional space is now defined with the components 
bik * <iik/ain* Obtain for this vector set the relations 

ofiftti "f” 4" * * * "1“ ocmbim ” 1 for i ~ 1, 2, • • • ft 

giving the appropriate expressions for the coefficients ak, and show that all these 
vectors terminate in the same plane. What is the equation of this plane? 
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Illustrate this problem forn == 4 with the vector set 


( 

1. 

1,0. 

1) 

( 

2, 

3.1, 

1) 

( 

4, 

0, 2, 

1) 

(- 

-2, 

-2,0, 

-2) 


6. Consider again the vector set of the previous problem and show that 

Z) =0 for /r = 1, 2, • • • « 

1=1 

Assuming that the vectors are so numbered that -1„„ 9^ 0, deduce the result 

— dn = «l<Il +Of2U2 + * • • -}- OTn-l^^n-l 


giving the values of the coefficients qta. Determine this relation of linear dependence 
for the numerical vector sets: 


( 

I, 2, 

1,0) 

(4,2.5, 

11 

( 

3, 

0, 

4,1) 

(3,0,4, 

1) 

( 

1 

0. 

0,1) 

|(^,o,o, 

-1) 

(■ 

-2, -4, 

-2,0) 1 

[(0,3,1, 

1) 


7. Suppose the vector set <Ji • • • of a matrix G in an //-dimensional space has 
the rank r=« — ^(1 < p < n), and assume the numbering of the vectors and com- 
ponents to be so chosen that the rth order minor in the up|)er left-hand comer of the 
determinant A is nonzero. Obtain linear relations between each of the p dependent 
vectors and r independent ones by the following procedure. 

From the original matrix, form submatrices by striking out all but the first r 
columns and aU but one of the last p rows. Call this row[thc (> -f i )th. There are p such 
submatrices as i goes from i = 1 to i == p, and each submatrix then consists of r 
columns and r -f 1 rows. A group of r 4- 1 minor determinants of order r is now 
obtained from each submatrix by deleting therefrom the ^th row, with /fe * 1, 2, 
• • • r -f- 1. Each of the minor determinants so generated is prefixed by a sign-con- 
trolling factor ( — to yield a “signed minor,’’ denoted by a**. Note that 

ai.r+l = Of2.r+l = • • • = Ofp.r+l ^ 0. 

Show next that the following systems of relations hold 


r-l-l 

Otxkdkq = 0 

k^l 


a -\.2 r-p) 

(q - 1,2, •••/O 


or that one may write the desired set of relations of linear dependence among the 
vectors: 

Ofllfll 4- 011202 4- * • * 4- Oti.r+lOr+l = 0 
Ol2lOl + 012202 4* • • • 4* Ot2,r+lOr+2 = 0 


OlplOi 4- Olp2Q2 4- • • * 4" «p.r+l<»i4-p * 0 

Illustrate with the numerical vector set of rank 2; 

[( 2 , 1 , 6 , 8 ) 

J (1,1, 3, 5) 

(0,4, 0,8) 

1 ( 0 , 1 , 0 , 2 ) 

8. Consider two vector sets ai • • • a» and • • • 6» with the square matrices Gt 
and 
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(a) If the vectors in the two sets are connected by the relations 

n 

£ CihfJik = hi for t = 1, 2, • • • w 
A; “1 

in which the matrix C is nonsingular, show that the components are connected by the 
matrix equation 

c X a = a 

and that the components of the corresponding transposed vector sets are related by 

Q,xC, = % 


(b) Regard ui • • • <i„ as a given vector set and as obtained from ai • * • On 

through linear transformation with the nonsingular matrix (?. Show that the rank of 
the vector set is invariant to this transformation but that such would not be the case 
if (? were singular. 

9. Two vector sets Ui • • •</„ and />! • • • with the square nonsingular matrices 
(2 and have the matrix products 

(jffi - and m = a 

(a) If the set ui • • • an is transformed by a nonsingular matrix C as indicated by 

ae ^ a 

how must the set • • • 6n be transformed in order that ^? = J?? 

(b) Derive the corresponding relation between S and S, and show that the 
determinants of these matrices are equal. 

10. Let Pi and Pj be two points in an «-dimensional orthogonal Cartesian co- 
ordinate system. The vectors xi and x% with components xn • • • xi„ and X21 • • • x^n 
respectively emanate from the origin of co-ordinates and terminate upon the points 
Pi and Pz. 

Express the direction cosines cosai, cosao, • • • cosa„ of a vector drawn from P2 
to Pi in terms of the components of xi and X2 and the distance / between Pi and P2, 
and prove the identity 

• cos* ai -f cos* a2 -h • • • -f cos* an = 1 


11. With respect to an oblique co-ordinate system, t' .e contravariant components 
of a vector x are denoted by {i, {2, • * • and its covenant components by f*i, f*2, 
• • • The corresponding components of another vector y are 7?i, 7?2, * * • Vn and 
V*h * * * V*n> If the contravariant components of the two vectors are related by the 
matrix equation 

[{i • • • X (? = [>;i • • • IJn] 

and the covariant components by a similar equation with the matrix £B, how is ffi 
related to (2? 

12. In an oblique co-ordinate system two vectors xi and have components which 
are conveniently expressed as the elements of the following row matrices: 

[f 11 {12 • • • £u ] [£21 £22 • • • £2n] (contravariant) 

[£*n £*12 • * • £*in] [£*21 £*22 • • • £*2n] (covarfant) 

The lengths of these vectors are h and 1%, 0 is the angle between them, and d is the 
distance between their tips. 
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(a) Prove that: 

- Ku - iu] X [^u- - i*u]i [1] 

and 

cos^ - [Xu] X = [\2k] X [X*u], [2] 

in which X,* = and X*u = 

(b) Prove the invariance of these expressions under any linear transformation of 
co-ordinates. 

(c) Show that Eq. 1 is equivalent to 

= /j 2 ^ 1^2 _ 21 , i, cos e 

(d) For any vector .r with contravariant and covariant components f* and 
show that 

[X;.] X [\\]t ^ 1 

Obser\'e that Xjt = {a- / or X*a = / cannot be rcprcsenlwl ;is the cosine of a real 

angle but that - 1 ^ XaX*a ^ 1- (The Xt arc called direction parameters and the 
\*k the moments of the vector. ) 

(e) In a given n-dimensional coordinate system let X,t and X*,a be the parameters 
and moments of a set of vectors whose components are regarded as the elements of 
row matrices. If a new coordinate system is introduced through a transformation with 
the nonsingular matrix [c.a], the parameters and moments arc carried over into the 
values \sk and X*«a. Show that the following expressions apply; 

[X.*] = [X.] X [fu] and [X*.i.] = [X*..] X 

13. The rows in the following matrix represent components of a given vector set in 
an orthogonal Cartesian coordinate system: 



(a) Compute the magnitudes of these vectors, their direction cosines, and the cosine 
of the angle between each pair. 

(b) Find the inverse matrix and, for the inverse vector set which it represents, 
compute the corresponding quantities to those in part (a). 

(c) Make a 120° isometric plot showing all vectors. 

14. Make computations as in parts (a) and (b) of Prob. 13, considering the matrix 

“1 4 1 3“ 

0 - 1 3 - 1 

3 10 2 

J -2 5 1_ 

Repeat the computations, using the transposed matrix. 

15. (a) Consider the vector set of Prob. 13 to define the directions of an oblique 
co-ordinate system. With reference to the original orthogonal Cartesian system, the 
new vector [1,3, — 1] is given. For this vector compute the contravariant and covari- 
ant components, as well as its direction parameters and moments (see Prob. 12). 

(b) Make an isometric plot showing the position of the vector and its above< 

mentioned components, 

(c) Repeat part (a) with reference to the vector set of Prob. 14. 
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16. Let the direction parameters Xu and Mkiot k = 1, 2, • • « be regarded as the 
contra variant components of two vectors and (sometimes called “ versors ”). 
If 0 is the angle between these versors, show that 

n 

sin'd = 2 


“ ^ gkt\2kMt^ — j 


in which j?,jt are the components of the fundamental metric tensor. 

1 7. The matrix of the fundamental metric tensor in a given space of four dimensions 
is 

“ 2111 “ 

G _ 1 4 2 1 

^“1231 
I I 1 1 


Compute: 

(a) The angles between the oblique axes. 

(b) The corresponding metric coellicients (reciprocals of the lengths of the unit 
vectors (Ji • • • (J 4 of the oblique system). 

(c) The length of a vector whose contravariant components are (5, 3, 2, 1), and 
its direction parameters. 

(d) Compute the matrix 9*. 

(e) The angles between the oblique axes dehned by 9*. 

(f) The corresponding metric coedkients. 

(g) The covariant components and moments of the vector given in part (c). 

18. Let 9 be the fundamental tensor corresponding to a system of oblique axes 
and let {] be a column matrix whose elements are the contravariant components of a 
vector. Because of a co-ordinate transformation with the nonsingular matrix C, the 
contravariant components of the given vector become the elements in the column 
matrix {] = C X {]. Denoting by 9 the matrix of the fundamental tensor with 
respect to the new co-ordinate axes, what are the expressions for 9 and 9* in terms 
of 9 and 9*.^ 

Taking for 9 the matrix in Prob. 17, and (5, 3, 2, O for the contravariant com- 
{X)nents of the given vector, compute the correspond; ig components of the vector 
with respect to the new coordinates if the transformation matrix is 


e « 


1 

0 

3 

I 


4 1 
-1 3 
1 0 
-2 5 


3 


-1 

2 

1 


Compute the angles between the new co-ordinate axes. 

19. Consider an oblique co-ordinate system and a point P such that all the contra- 
variant comp>onents of a vector drawn from the origin to P are p)Ositive. The portions 
of the oblique axes coinciding with these vector components are regarded as the 
coterminous edges of a parallelepiped. 

(a) Show^ that the volume of this parallelepiped is given by the formula 

V = VGflisij 
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in which G is the determinant of the matrix 9 and Ji, { 2 , fa are the contravariant 
components of the vector. 

(b) Write down the expressions for the areas of the faces of this parallelepiped 
which lie in the 1,2-plane, the 2,3-plane, and in the 3,1-plane respectivdy. 

(c) Recognizing that the extension of the above formula to more dimensions reads 

V = vc;fif2 -f. 

compute the volume of the hyper-parallelepiped in four-dimensional space defined by 
the matrix 9 of Prob. 17 and a vector with the contravariant components [1, 2, 3, 1]. 
Compute the area of the parallelogram for each pair of co-ordinates. 

20. Let Jfi. .T 2 , • * * Xn be the contravariant co-ordinates of a point P in »-dimen- 
sional space with respect to a set of oblique axes. This system is regarded as immersed 
in an w-dimensional space with m > «. In the m-dimensional space the co-ordinates 
of the same point P with respect to an orthogonal set of axes are yi, y 2 , * * * ym. The 
orthogonal and the oblique axes have coincident origins. 

Show that the y* s are expressible in terms of the .ijt’s in the form 

n 

E = Vi (: = 1, 2, • • • w) 

but that not more than n of the y^’s are independent. More specifically if the first n of 
the above equations are independent show that Va+i, yn+ 2 , • • • may be expressed 
in terms of yi • • • >•„ by means of the relations 

I n n 

— 2 £ a.i.-ljiVy = y. (I = « + 1, n + 2, • • • m) 

-I j-i 


in which A is the determinant corresponding to the first n rows of the matrix fowm] and 
A jk are the appropriate cofactors. _ 

Show further that the squared line element ds- is expressed b\' 


in which 


ds^ = Z (dy,)- = L L ^dx,dx, 
t - 1 t . I j - 1 dxk dxj 


= Z ikjdxk dxj 

k,j^\ 




_ ^ dyi dyi 
dxk dxj 


are the components of the fundamental metric tensor in the n-dimensional space. 
Compute the values of gkj in terms of the Ur# of the transformation. 

2l. Let x] and a] be column matrices and regard their elements xi • • • x„ and 
ofi • • • ttn as components of a variable vector x and a fixed vector a with respect to an 
orthogonal co-ordinate system. 

(a) Show that the equation 

a]tx] ^ d (a scalar) 


represents the equation of a plane, that the vector a is normal to this plane, and that 
its distance from the origin is / « d/ (magnitude of a). 

(b) In transforming to a new system of oblique co-ordinate axes, show that the 
distance of I and the point defined by a will not change if the vector x is subjected to 
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the transformation 

a*x] = f] 

whereas the vector a transforms as 

Qa] = a*] 

22. Let {] and a] be column matrices whose elements are the contravariant com- 
ponents of a variable vector and a fixed vector respectively with reference to an 
oblique set of axes in three-dimensional space. Show that the plane defined by 

«]< X f] = d (a scalar) 

cuts the co-ordinate axes at points whose distances to the origin are given by 


sj 




£ 


and that the cosines of the angles that the plane makes with the co-ordinate axes are 

cos^y = 

in which \a\ denotes the length of the vector a. 

23. I^t the equations of two planes be given by 

ct]t X {] = di and X fl == d 2 

Show that they intersect at an angle $ for which 

and compute this angle for the planes corresponding to vectors a and 0 with com- 
ponents 3, -1,4 and 2, 8, 6 with 



!“1 


24. Suppose that a space of three dimensions is imm ersed in a space of four dimen- 
sions. Let y be a vector whose components when refe red to a set of orthogonal axes 
in the four-dimensional space are yi, yj, ys, y 4 . The point determined by this vector 
lies also in the three-dimensional space and is there characterized by the co-ordinate 
Xu Xu Xz. 

Denoting ree constant vectors in the four-dimensional space by a, 7 , show that 
the equation 


yi y% yi yA 

ai at as ^4 

Pi Pi Pi Pa 

7i 72 73 74 


= d (a scalar) 


represents a plane in the three-dimensional space and find the equation of this plane 
in terms of the x-co-ordinates, recognizing that one may write 
3 

L OikXk * yi for i = 1, 2, 3, 4 

it»l 
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25. With respect to an orthogonal set of axes in three-dimensional space, three 
tx>ints Pi, Ps, Ps determined by the linearly independent vectors o;, /3, X lie in a plane. 

(a) Write the expression determining a variable vector x which terminates in this 
plane. 

(b) Determine a unit vector perpendicular to the plane. 

(c) If the points Pi, Pj, P3 are transformed into Pi, P2, P3 through a nonsingular 
transformation with the matrix (J such that the vectors become 

a] = Qiof] 

7] = '3,7] 

show that the transformed vector terminates in the new plane. 

(d) Let the components of a, 7 be given by (3, 2, 1 ). (4, 1,5), (1 , 1, 1), and find 
the transformed plane corresponding to 



26. Let P be a variable point on the surface of a hypersphere of radius p and with 
its center at Po- P and Po are determined resi>ectively by the variable vector x and 
fixed vector a. The co-ordinate system is orthogonal and the equation of the hyper- 
sphere is given by 

[xk - Of*]/ X [jf* - a*] = P‘ 

in which xi • • • x„ and ai ••• an are regarded as elements of column matrices. 

If the points P and Po are transformed into P and Po through a nonsingular trans- 
formation with the matrix Q, compute the distance between the two new points P 
and Po and show that the new point P does not in general lie upon a new hypersphere. 
Illustrate forn = 2 with ai * a2 = 1 and for the matrix 

^’[3 2] 

find the locus of the point P as the point P traverses its circle. 

27. (a) If a matrix SB is obtained through a collineatory transformation of the 
matrix Q, show that the matrices S" and S”* are obtained through the same coUin- 
eatory transformation of the matrices G" and Q"*. 

(b) If matrices St and S are obtained through the same collineator>' transformation 

of matrices fl and £D respectively, show that the matrices (SB -f- 6), SBg, g£B, 
(36)”^ and are obtained through the same collineator>' transformation of 

(Q + 3)), Q9, a)Q, and (SDG)"** respectively. 

(c) If S is obtained through a collineatory transformation of G, how is S, related 
toG,? 

(d) Make a collineatory transformation of the matrix 


' 3 

2 

r 

r~i 1 r 

-4 

1 

1 

with the matrix C * 1 0—2 

-1 

~2 

5 



(e) Let X and a respectively be a variable and a fixed vector, and assume that 
a\t X *] * 0. If y] * C^^GCx], in which G and C are nonsingular, determine the 
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vector 0 satisfying the relation /3]iy] ~ 0 and illustrate for « = 3 with a]t = [3, 2, 1] 
and matrices fl and C as given in part (d). 

28. Find the latent roots of the following matrices 


-13 241 fl 2 4l ri91 -257 

-11 21 13 9 114 -152 

-6 12 J Ll 4 16j L 85 -115 



29. Find the latent roots of the matrix 




2 

-2 

3 


1 

1 

1 



Form the mwial matrix £ and check numerically the relation £ ^(?£ = A. Make an 
isometric plot showing the directions of the principal axes. 

30. Prove that the determinant and the rank of a matrix remain invariant under a 
collineatory transformation. 

31. Show that the roots of the characteristic function are invariant to elementary 
transformations of the characteristic matrix. Through an appropriate succession of 
elementary transformations of the characteristic matrix, show that the latent roots 
of the matri.x 


“5 0-1 0“ 

0 5 0 1 

9 15 0 

0 9 0 5 


arc Xi = Xj = 5 — j3 and Xj = X 4 = 5 jS, 



CHAPTER IV 


Quadratic Forms 

1. The quadratic form associated with a linear transforma- 
tion 

If in the linear transformation 

<*11-^1 -f + * • * + = V| 

^2 1-^*1 ”1“ <*22-'*2 “h • * * + (l2n^‘n = y * f 1 1 


/1 1*^1 “1“ ^^n2*^*2 “1“ * * * ”1“ ^nn^n 

the equations are multiplied respectively by JCi, .V 2 , • • • Xn, the sum of the 
resulting left-hand members is a rational entire function which is homo- 
geneous and quadratic in the variables .Vi • • • .v„. This function is called a 
quadratic form. More specifically it is referred to as the quadratic form 
associated with the linear transformation 1. 

Written out, this function has the appearance; 

F == + *^12X2X1 + * • + OinXnXi 

+ ^21X1X2 -f 022X2^ “h • • • + a2nXnX2 r^l 


-f (ZniOriAn -h an2X2Xn + * * * + annXf^ 

The terms on the principal diagonal of this square array involve the 
squares of the variables JCi • • • jc^; the remaining terms involve all the 
possible cross-products. 

The matrix fl of the transformation 1 is called the matrix of the 
quadratic form, and its determinant A is referred to as the discriminant 
of the quadratic form associated with this transformation. This chapter 
is concerned with real quadratic forms, that is, such forms which have 
matrices with real elements. 

The writing of this function can be abbreviated through the use of a 
double summation, thus 

n 

P = "L Oii^iXk [3] 

Here the terms for f = 1 and k from 1 to » are those in the first row of 
Eq. 2; the terms for i = 2 and k from 1 to n are those in the second row of 
Eq. 2, and so forth. 

Since the terms with a,* and aki for like values of i and k{kj^i) involve 
the product of the same pair of variables x, and Xk (for example, the terms 

132 
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ai 2 ^\X 2 and a 2 iX 2 Xi), the quadratic form is evidently no less general if the 
matrix fl of the associated linear transformation is assumed to be sym- 
metrical; that is, if 

CLik = (iki or Qt = Gi [4] 

Hence in the discussion of quadratic forms it is possible to assume that 
the associated linear transformation is a symmetrical one, and thus to 
make available to the discussion the sp>ecial properties of such transforma- 
tions pointed out in the last article of the preceding chapter. 

A quadratic form may also be thought of as obtained through the 
squaring of a linear form; thus 

F = {aiXi -h Ot2X2 + • • • + OtnXn)^ [5] 

although the result is less general because it contains only n arbitrary 
coefficients.* Its formal appearance is, however, the same, as may be 
seen if all the terms in the full square represented by Eq. 5 are system- 
atically written down; thus 

F = + aia2^i^2 + * * + aianXiXn 

-f- + CC2^X2^ . . . -^ OC2anX2Xn r,-. 


-h a„a|A*„.Vi -|- ana2XnX2 -f • • • -f an“Xn^ 


Identification of Eqs. 2 and 6 leads to the relation 

Qik = aiotk [7] 


The function F as given by Eq. 5 is more convenient for the derivation 
of the following useful relations. Thus it is readily seen that the partial 
derivative of F with respect to one of the variables jr, is given by 

dF 

— = 2a,(aiXi -h a2A*2 + • • • + «n^n) [8] 

dXi 


By substitution of Eq. 7, the corresponding result for the function F of 
Eq. 2 readsf 


1 ^ 

2 dXi 


^ilXi + <^42X2 + * * * + 


[9] 


This result allows the quadratic form to be expressed in terms of its par- 

•The rank of the quadratic form (which is the rank of its matrix) is in this case 1, as may 
be seen from the fact that the coefficients given by Eq. 7 are formed in the same fashion as 
are those of the matrix in Eq. 131, Ch. III. For the present discussion the rank of the quad- 
ratic form is immaterial. 

fThis result may, of course, also be obtained directly from Eq. 2, utilizing the symmetry 
condition 4. 
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tial derivatives thus, 

„ 1 I dF dF 

1 T *+■ 7 — • -^2 + • 
2 [ 6a*i Bx2 

or in the more compact notation as 

F = zi:—Xi 

2,=.i dXi 


dF 


[ 10 ] 


[ 11 ] 


Correspondingly, the linear transformation 1 assi)ciated with a quad- 
ratic form F may, according to Eq. 9, be written in the form 

1 dF 

9 77 " >■' » = 1 . 2 , • • • «) [ 12 ] 

OX I 


2. Geometrical interpretation of a QrADRATic form; the 

QUADRIC SLTIFACE ASSOCLATED WITH A LINEAR TRANSFORMATION 

According to the derivation of the quadratic form from its associated 
linear transformation 1, it is clear that the function F may be identitied 
with the following bilinear form in the variables . 1*1 • • • .v„ and y\ • • • y,*: 

F == A'lyi 4- A:2y2 + * * * 4- .v^Vn [L?] 

If Afi • • • Xn and Vi • ♦ • r„ are regarded as the comix^nents of the vectors .v 
and y respectively, the quadratic form is represented by the scalar 
product 

F = xy [14] 

That is to say, the value of the quadratic form may be expressed as 
the scalar product of the vector x with its transform y. Of interest in 
connection with this interpretation is inquiry into the significance of the 
equation 

F=1 [IS] 

or in other words into the question: For what values of the variables 
Xi ' • • Xn does the quadratic form maintain a constant value? In Eq. 15, 
the constant value is arbitrarily set equal to unity. The question may, in 
view of Eq. 14, be stated in another way: For what lengths of the vector .v, 
which, it is assumed, may have all possible orientations, does the scalar 
product of the vector with its transform y have the fixeil value unity? 

As the vector x assumes all possible orientations, under the stipulation 
that its scalar product with the vector y maintains a unit value, its length 
is evidently forced to vary in a very definite manner. The tip of the vector 
X, therefore, describes a surface (in w-dimensional space this surface is 
n - 1 dimensional), and the solution to the inquiry stated above is seen 
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to resolve itself into the determination of the surface defined by Eq. IS, 
in which F is a function of the variables Xi ■ • • 

Since the function F is quadratic in the variables the surface 

in question is evidently one of second order, which is also known as a 
quadric surface. In three dimensions the quadric surfaces are those of the 
familiar ellipsoid, the paraboloid, or the hyperboloid. The quadric sur- 
faces in //-dimensional space may be visualized in a similar manner, 
although their reality is, of course, lost. 

Since the function F contains no linear terms, its value is unchanged if 
the algebraic signs of all the variables are reversed. This fact means that 
the vector .v satisfying Eq. 15 has the same length when its direction is 
reversed. Hence the quadric surface is symmetrical about the origin. 
A surface of this type is referred to as a central quadric surface (alterna- 
tively as a central conic). 

'File central (juadrie surface has hyperbolic or ellipsoidal characteristics, 
or both, cleix-nding upon the values and the algebraic signs of the co- 
efiicients a^k in the form F. This (fuestion is discussed further in a subse- 
quent article. 


3. Transformation’ of variables 


When the variables .Vi • • • .v„ in the quadratic form given by Eq. 3 are 
subjected to a linear transfonnation such as 


.v] = e/] 


[16] 


in which x] and .v'] are the 


column matrices 




[17] 


and C is an arbitrary square matrix of order n, then F becomes a quadratic 
form in the new variables .v'l • • • .v'„. When the matrix G is nonsingular, 
the new variables are uniquely relatcnl to the original variables Xi ■ ■ • x„, 
and hence for given variables .V| • • • .v„ the quadratic form expressed in 
terms of the new variables has the same value. In other words, the 
quadratic form in the original variables is identical with that expressed 
in terms of the new variables, the change of variable being nothing more 
than a formal change in the notation. 

At the same time, the transformation expressed by Eq. 16 may have 
the geometrical significance of a change of co-ordinate systems. In any 
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case, the immediate problem is to determine the matrix of the quadratic 
form for the new variables in terms of its matrix for the original variables 
and that of the transformation 16. 

For this purpose, it is useful to recognize that Eq. 2 for the quadratic 
form may be written in matrix form. First the transformation 1 is written 
in the matrix form 

Qx] = y] [18] 


in which Q is the matrix of the coefficients apix'aring in Eqs. 1, and y] 
is the column matrix 


>’1 


Vo 


>•] = 


[19] 



According to the form for F given by Eq. 13, it is seen that 

F = X y] [20] 


in which ^ is the transpose of the column matrix x]. Substituting for 
y] from Eq. 18 yields the desired matrix e.xpression for F, namely, 

F = ^Gx] [21] 


The expression for F in terms of the new variables x'l • • • x'n is now 
readily obtained by substituting for x] from Eq. 16, and recognizing that 
the transpose of this equation gives 

^ [ 22 ] 

Thus it is found that 

F = X G X Cx'] [23] 

which may be written 

F = [24] 

in which 

SB = X G X C [25] 

is the desired matrix of F for the new variables x\. 

Equation 25 gives the transformation formula for the matrix of a 
quadratic form when its variables are subjected to a linear transformation 
with the matrix C. This is called a congruent transformation of G. It is 
recognized that since G is symmetrical, the matrix SB is also symmetrical. 
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It may be noted that the transformation expressed by Eq. 25 is very 
similar to the collineatory transformation given by Eq. 167 of Ch. III. 
The two transformations become identical if the matrix C is an orthogonal 
one, that is, if the transformation 16 relates the co-ordinates Xx - • ■ Xn 
in a given rectangular co-ordinate system to the co-ordinates x'x - ■ ■ x' „ 
in another rectangular co-ordinate system with the same origin. This 
conclusion is recognized from the fact that for such a transformation 
C, = (orthogonality condition). 

Moreover, since the determinant of an orthogonal matrix has the 
value ± 1 (see Art. 6, Ch. II, and Art. 4, Ch. Ill) it is clear that for this 
kind of co-ordinate transformation the determinant of the matrix £B is 
equal to that of (2. In other words, the discriminant of a quadratic form is 
invariant to an orthogonal transformation of its variables. 

4. The principal axes of a qU/Vdric surface; the reduction 

OF A quadratic FORM TO A SUM OF SQUARES; DEGENERACY AND 

R.ANK 

The quadric surface defined by Eq. 15, although central, does not 
have its principal axes (like the major and minor axes of an ellipse) 
coincident with the axes of the reference co-ordinate system (the set of 
rectangular a.xes on which the components of the vector x are 

projected). If they were coincident, only the square terms in Eq. 2 
would be present. The presence of the cross-product terms means that 
the principal axes of the quadric surface have arbitrary orientations in 
space relative to the directions of the reference axes. 

It is a common problem in analytic geometry to determine the orienta- 
tions of the axes of the quadric surface defined by Eq. 15 when the 
function F is given in the form of Eq. 2. If these directions can be found 
and chosen as the axes of a new co-ordinate sys em, a change of variables, 
which amounts to a transformation to these new co-ordinates, should 
have the effect of eliminating the cross-product terms from the expression 
for F. 

Thus the geometrical problem of finding the principal axes of a quadric 
surface is seen to be essentially the same as the dgebraic problem of 
reducing an arbitrary quadratic form to a sum of squares. 

According to the vector interpretation given in Ajt. 2 of this chapter, 
the quadric surface is generated by all possible x-vectors whose scalar 
products with their transforms y have the fixed value unity. For any 
given direction of the vector x, its length is the distance from the origin 
to the surface. If the direction is along a principal axis, this distance is 
evidently a maximum or a minimum as compared with the distances for 
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any neighboring directions.* This fact forms a convenient basis for finding 
the principal axes, since it reduces the task to a maximum-minimum 
problem. 

The square of the length of the vector x is given by the expression 

;.Vj“ = .1*1“ + ^' 2 “ “{-•••+ -Vn" [26] 

Since the maxima and minima of the length of .r are coincident with those 
of the square of this length, there is no need to take the square root of the 
expression 26. In order that this length may represent the distance from 
the origin to the quadric surface, however, the variables Xi • •• .i*„ must 
satisfy the equation 

F = 1 [27] 

in which F is given by Eq. 2. 

According to the usual procedure for finding maxima, the expression 
for one of the variables obtained from Eq. 27 is to be substituted into 
Eq. 26, and the resultant function made a maximum or minimum by 
setting its partial derivatives with respect to the remaining variables 
equal to zero. Obviously, this undertaking is exceedingly awkward in 
view of the complicated form of the function F. Hence in problems of this 
sort a slightly altered procedure, known as the method of determining 
cofiditiomd maxima (or minima), is adopted. 

Thus it is said that the function 26 is to be made a maximum subject 
to the condition imposed by the auxiliary relation 27. The procedure is 
to form the function 

}{Xx, • ■ ■ Xn) = + Xn' - - 1) [28] 

in which H is an arbitrar>' constant (called a Lagrangian multiplier). 
In view of Eq. 27, this function is evidently the same as |x|^, or rather it 
should be said that the function 28 becomes the same as |xf when the 
condition 27 is fulfilled. 

The partial derivatives of/(xi, • • • Xn) equated to zero, that is 

— = 0 for ^ = 1, 2, • • • » [29] 

dXk 

yield n equations. Together with the condition 27, these may be solved 
for the multiplier H and the special values of Xi • • • which yield the 
desired conditioned maximum. 

mds drounstance is readily visualiaed in the case of an ellipsoid, for which all these 
distances arc real. For the moment, the geometrical interpretation may be confined to this 
type of surface, although the results of the present discussion, as ap()ears later on, nre not 
restricted to eUipsoidal quadric surfaces. The discussion in .\rt. 10 deals further with this 
question. 
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Forming the partial derivatives in Eq. 29 for the function expressed 
by Eq. 28 gives 


2xi - H — =0 

bXi 

2x2 - H — = 0 


[30] 


2x„ - H 


dF 


dXn 


= 0 


If the values for the partial derivatives of F given by Eq. 9 are substituted, 
these equations become 


+ • • • + (llnXn = 

0-21X1 + O22X2 + • • ■ + <l2nXn — ^2 


[31] 


+ ^n2^2 + * * * + O/in'^'n = 


in which 


[32] 


On the other hand, if Eqs. v^O are multiplied by Xj, X 2 , • • • respectively 
and added, Eq. 10 bein<^ noted, there results 

(X,- + Xj"* + • • • + X„^) - HF = 0 [33] 

and in view of the condition 27, this yields 

H = (x,^ + X 2 ^ + • • • + X„*) = |x|® [34] 

Equations 31, 32, and 34 together represen. the solution to the maxi- 
mum-minimum problem. The results are seen to be extremely interesting, 
since Eqs. 31 are recognized as the conditions for which the direction of 
the vector x is left unchanged by the linear transformation 1, as dis- 
cussed in Art. 10 and 12, Ch. III. The results of the present problem, 
therefore, yield an alternative geometrical interpretation for these 
directions, namely: The (nincipal axes of the quadric surface defined by 
Eqs. 2 and 15 together are those directions in space uhich are invariant to 
the symmetrical transformation I . 

According to the discussion in Art. 10, Ch. Ill, the direction cosines 
of the principal axes are, therefore defined by Eq. 147 of that chapter, 

t 

in which the Jfe,* are the cofactors of the elements of the determinant in 
Eq. 146 for the latent roots X,. Each latent root defines a corresponding 
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principal axis, and the modal matrix £, given by Eq. 151, yields the 
directions of the complete set of n principal axes. 

Since in the present discussions the matrix fl of the transformation 1 
is symmetrical, the modal matrix £ is orthogonal, as shown in Art. 12, 
Ch. III. In other words, the principal axes of the quadric surface forms a 
mutually orthogonal set. 

According to Eq. 52, there are n values for the Lagrangian multiplier 

H, = ^ [35] 

corresponding to the n latent roots of the matrix (i. The quantities Hi, 
H 2 , • • • H„, which arc the reciprocals of the latent roots, are called the 
proper values {cigcnzi'crte) of the matrix (2 or of the corresponding quad- 
ratic form F. Since the latent roots for a symmetrical matrix arc real (as 
shown in Art. 12, Ch. Ill), it is seen that the proper values of the quad- 
ratic form F are real also. 

An interesting geometrical signiticance of these proper values is given 
by Eq. 34, which, for a particular vector x appropriate to the root X,, 
reads 

Ha = (.1*1 “ -}- .1*2“ + * • • + A*n*) — |i‘|^ [36] 

and hence represents the square of the length of the corresponding 
principal axis extending from the origin to the quadric surface. These 
lengths are commonly referred to as the semiaxes of the quadric surface. 

This result yields a geometrical interpretation to the latent roots of a 
symmetrical matrix. They are the reciprocals of the squares of the semiaxes 
of the associated quadric surface. If the latter is ellipsoidal, the lengths of 
all the semiaxes are real, and the squares of these lengths are positive. 
The quadric surface is, therefore, ellipsoidal when all the latent roots of 
the matrix Q are positive. The appearance of negative roots indicates 
that some of the lengths of the semiaxes are imaginary and hence that the 
surface has hyperbolic as well as ellipsoidal characteristics. If all the roots 
are negative, the surface is an imaginary ellipse, but it is customary to 
include this in the classification of completely ellipsoidal surfaces. In 
any case, the above maximum-minimum problem deals only in real 
values, since the square of the length of the vector x rather than its length 
enters into the manipulations.* 

If the principal axes of the quadric surface are chosen as a new co- 
ordinate system, and if the variables x'l • • • x\ refer to this system, the 
co-ordinate transformation from the original variables Xk to the new 
variables x\ is given by the matrix equation 

x] = £/] [37] 

^The in Art. 10 is also relevant to these matters. 
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According to Eqs. 16 and 25, the matrix of the quadratic form P for the 
new variables is 

£« X Q X £ [38] 

But by Eq. 208 of Art. 12, Ch. Ill, this is the diagonal matrix 

'X, 0 0 ■ • • 0 


A = £, X (? X £ 


0 X, 0 • •• 0 


LO 0 


X„, 


[39] 


Hence the quadratic form F expressed in terms of the new variables 
x\ reads 


F = ^Ax'] 

= Xi.r i" + \.iX ' + • • • + XnX^n® 

/ > / O ft} 

^ -..L 'i- ^ ^ 

III H, ^ ^ H, 


[40] 


Equated to unity, this is the equation of the quadric surface in the 
rectangular co-ordinate system which is coincident with the principal 
axes of that surface. It is referred to as the normal form for the equation 
of the surface. At the same time, Eq. 40 represents the reduction of the 
quadratic form F to a sum of squares. 

This reduced form for F, together with the geometrical interpretation 
of the process of reduction, now yields a basis for interpreting the sig- 
nificance of the occurrence of zero roots or coincident roots to the char- 
acteristic equation of a symmetrical matrix. Thus coincident X-roots 
evidently indicate a certain degeneracy of the associated quadric surface. 
F'or example, in three dimensions a coincidence of two of the latent roots 
in the case of an ellipsoid results in an ellipsoic' of revolution, whereas for 
a coincidence of all three latent roots the elli jsoid of revolution degener- 
ates into a sphere. 

When two of the latent roots are coincident, it thus becomes clear from 
the discussion of simultaneous homogeneous equations in Art. 7, Ch. Ill, 
that the characteristic matrix (G — X^), with X equal to the value of 
the coincident root, must have the rank » — 2. 

This reduction in rank of the characteristic determinant is required 
because a repeated nonzero X-root would otherwise suggest that in- 
trinsically less than n independent axes were required to describe the 
associated quadric surface. In other words, there would be less than n 
independent directions for which the vector describing the quadric 
surface goes through extrema. It is geometrically clear, however, that in 
the case of repeated nonzero X-roots, the quadric surface still occupies 
intrinsically n dimensions in space, and therefore requires n independent 
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axes to describe it. Although the symmetry resulting from the equality 
of the lengths of two or more of the semiaxes of the surface means that the 
directions of these semiaxes are not unique, it must nevertheless be pos- 
sible to pick correspondingly two or more independent (in fact, orthog- 
onal) directions along which one may assign axes to describe the surface. 
Hence if there are p repeated nonzero X-roots in a symmetric matrix 
describing a quadric surface in n dimensions, it must somehow be possible 
to find exactly p independent, but not necessarily unique, solutions for the 
X corresponding to the repeated root. This fact requires that the char- 
acteristic matrix have rank n — p for the value of X in question. Under 
such circumstances the discussion in Art. 10, Ch. Ill, gives the procedure 
for finding such a set of n directions, and the corres{x>n(Iing nonsingular 
modal matrix £ which describes them. It is necessary to point out, 
however, that this geometrical discussion based ujx)!! quadratic forms is 
valid only for symmetric matrices, and, as jx)inted out in the article 
mentioned above, it is not possible to extrajwlate the conclusions thus 
drawm regarding coincident X-roots to the general case of nonsymmetric 
matrices.* 

When the matrix Q of the transformation 1 is singular — more specifi- 
cally, if the rank of the matrix (2 is ;/ — />— it is possible to satisfy 
Eqs. 31 in /^-independent ways for X = 0, which means that p of the 
latent roots are zero, and the corresponding p proper values H, (squares 
of the semi-axes of the quadric surface) are infinite. liquation 40 for the 
quadratic form F then has only n — p terms, and the associated quadric 
surface is again degenerate, but in a way somewhat different from its 
degeneracy in the case of repeated nonzero roots. For e.xample, a three- 
dimensional ellipsoid of rank 2 (the matrix Cf has the order 3 and the 
rank 2) is an elliptic cylinder because one of its semiaxes is infinite. It is 
still possible to find a nonsingular modal matrix £ which reduces the 
original matrix to diagonal form because of the fact that p independent 
vectors x may still be found for X = 0. The procedure is exactly that 
described in Art. 7, Ch. Ill, and corresponds to the method used for 
nonzero repeated X-roots. Moreover, the converse of the present state- 
ment is also true; namely, that the occurrence of a zero X-root of order p 
in a symmetric matrix of order n requires that the rank of the matrix be 
exactly n — p. The geometric reasoning substantiating this assertion is, 
again, that it must be possible to find n dimensions intrinsically occupied 

* In general such a nonsymmetric matrix cannot be reduced to diagonal form by a col- 
lineatory transformation, since no nonsingular modal matrix may be found.. An indication of 
what can be done by way of reduction may be found in G. Birkhoff and S. MacLanc, A Survey 
of Modern Algebra (New York: The Macmillan Co., 1941), pp. 307-308, or R. Courant and 
D. Hilbert, Metkoden der mathematischen Physik (Berlin: Julius Springer. 1931), Vol. I, Ch. 1, 
pp. 36-37. 
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by the surface, even if p of them merely indicate co-ordinates upon which 
the function describing the surface is actually not dei>endent. 

Thus it may be stated that a given quadratic form of rank r, when reduced 
to a sum of squares^ contains r terms. That is, the new variables x'l • • • x!r 
are only r in number whereas there are n original variables • • • Xn. 
This result agrees with the geometrical interpretation because the descrij>- 
tion of an elliptic cylinder, in the normal form for example, requires 
only two variables inasmuch as the function describing the cylinder is 
independent of the longitudinal dimension. 

5. A RELATED MAXIMUM-MINIMUM PROBLEM 

A problem complementary^ to that treated in the preceding article is to 
determine the maxima or minima of the quadratic form F subject to the 
condition 

^x] = X‘i“ + + • • • + Xn^ = 1 [41] 

imjx)scd upon its variables. Here the function 

/(.Vi, • • • A*„) = p — X(a:i^ + 4* • • • + Xn^ — 1) [42] 

in which the Lagrangian multiplier is denoted by X, represents the 
quadratic form provided the condition 41 is fulfilled. 

The partial derivatives of f{xi, • * • Xn) equated to zero are 

dF 

2X.VA: = 0 for ^ = 1, 2, • ■ • » [43] 

dXk 

In view of Eq. 9, these again yield Eqs. 31. Multiplying Eqs. 43 respec- 
tively by Xi, X 2 , • • • .v„, adding them, and noting Eq. 10 gives 

-F — X(xi ’ + .ro“ 4- * • • + Xn^) = 0 [44] 

Because of the condition 41, this is 

F = X [45] 

The conclusion is that the maxima or the minima of the quadratic 
form F, subject to the condition 41, occur for values of the variables 
defining the vectors .r whose directions are invariant to the linear trans- 
formation 1. Equation 45 shows that the corresponding maximum or 
minimum values of the quadratic form are equal to the latent roots of 
its matrix, that is, 

= X. (s = 1, 2, • • • «) [46] 

min 

In the present instance, the quadratic form F is not restricted to a 
constant and hence cannot be said to represent a quadric surface at all. 
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The tip of the vector v, on the other hand, does trace out a quadric sur- 
face as a result of Eq. 41 and the relation 18 between x] and 3^]. The 
extrema of | v.*, or the squares of the semiaxes of this surface, are given 
by the reciprocals of the latent roots of its matrix (2"“- (Art. 4 of this 
chapter). But, by reason of the thoughts underlying the proof of the 
Cayley-Hamilton theorem (Art. 11, Ch. Ill), these roots are the recip- 
rocals of the squares of the latent roots of Q. Hence the extrema of |y!, 
or the semiaxes of the associated quadric surface, are just the latent 
roots \s of Eq. 46. Thus the quadric surface associated with the trans- 
formation 1 may be alternatively thought of as the locus of the tip of the 
vector y, when the tip of the vector x is restricted to lie on the surface of 
an //-dimensional sphere, Eq. 41. The semiaxes of this surface may how- 
ever be calculated in both magnitude and direction from the present 
conditions yielding the extrema of F subject to the condition 41. It is 
of interest to obser\'e in this connection that the surface in question is 
always ellipsoidal, regardless of the nature of the nonsingular matrix Ct. 

6 . An interesting /Vpplic.xtion of these results 

In view’ of the results of this alternative maximum-minimum problem, 
a further particular question may be answered regarding the transforma- 
tion 1. In addition to inquiring whether it is possible for the transform of 
X to be zero, or to have the same direction as .v, one may ask : Is it possible 
for the transform of x to be a vector at right angles to Jt*? In other words, 
under what conditions is the scalar product x • y zero when neither x nor 
y is zero? 

It is shown by Eq. 14 that this scalar product is equal to the quadratic 
form F, Hence the present question amounts to inquiring whether F 
may vanish for a nonzero vector jc, that is, for 

+ ^2^ H + Xn^ = k 7 ^ 0 [47] 

In the particular application to be considered, it will suffice to find the 
answer to this question for the special case in which the quadratic form F 
is positive definite (cannot go negative). Under such conditions, the places 
where F becomes zero are clearly minima of the quadratic form, and the 
question originally asked is equivalent to asking for the conditions under 
which a stationary point of F shall occur for F = 0 * 

•In the case where P is not a positive definite form, it is not necessary for its zeros to occur 
at stationary points (subject to condition 47). The example F « xi* — X 2 * illustrates the 
point in question, when coupled with the condition xi* -f- X 2 * * iS; ^ 0. F is zero only when 

k k 

xi® =® > and the corresponding values of y] are yi* “ >2* = ;: • The stationary points 

of F are, however, at X 2 = 0, xi^ = k and xi = 0, X 2 * * k. Corresponding values of F and y] 
are y 2 * 0, yi* = k and yi - 0, - k\ while F - k and F = respectively. 
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If, then, condition 47 is substituted in place of Eq. 41, the above 
analysis remains essentially unaltered. In particular, Eq. 45 is replaced by 


F 


[48] 


The answer is that F may i)e zero iox k 9 ^ 0 only when one or more of 
the latent roots are zero. The occurrence of a zero root means that the 
matrix Cf is sinf^ular. If at the same time Eqs. 1 are to have solutions for 
a nonzero y-vector, the latter must be a member of the transposed vector 
set of Cf, which is the same as the vector set of (2 since (2 is symmetrical. 
Although the vector .v constituting a solution is orthogonal to y, it is not 
simultaneously orthogonal to all the vectors in the set of (2, since y would 
then necessarily be zero.* 

A useful application of these thoughts is to the problem of determining 
whether a given vector set is linearly dependent or independent. The 
m vectors Vi, is, • • in an «-dim('nsional space, expressed in terms of 
their components, yield the nonscjuare matrix 


^’12 * 

* ‘’iH 

„‘'ml ‘m2 * 

' * ‘mn. 


[ 49 ] 


for which it is to be assumed that m < n. 

To determine whether this vector set is linearly dependent or not re- 
quires, according to the discussion in Art. 2, Ch. Ill, that all w-square 
determinants be fonned from the matrix 49 by selecting m columns in all 
possible combinations. The linear dependence of the vector set is estab- 
lished only if all these determinants are found to be zero. 

A considerably less laborious procedure is the following. The vector 
set is linearly dependent if there exists a relation of the form 

4“ 4“ * * ‘ 4" = 0 [50] 


in which at least one of the coefficients is mA zero. The latter condition 
may be replaced by the requirement that 

4" ■^' 2 '^ 4" * • • 4" = 1 [51] 


which is neither more nor less binding than to require that at least one of 
the a*a:’s be other than zero. 

The quadratic form obtained by forming the scalar product of the 
vector expression in Eq. 50 with itself may be written 

n 

F = {XiVi + .Vot'2 + • • • + = . L (l-.- ’ Vk)XiXk ^ 0 [52] 


Fulfillment of Eq. 50 requires that the quadratic form F be zero. Hence 
•These matters are discussed in Art. 7, Ch. III. 
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if the given vector set is linearly dependent, it must be possible to find 
F = 0 subject to the condition 51. In other words, one of the latent roots 
of F must be equal to zero. According to the previous discussion this fact 
requires that the discriminant of the form 52 be zero; hence that 


Vi 


Vi < 

• 1*2 • 




V2 

■ 

V2 * 

?2 • 

. . x »2 

• Vm 

= 0 

Vm 

• '^1 

I’m 

• ?2 * 

• * 




[53] 


Thus the vanishing of only a single m-square determinant (called the 
Gramian determinant) need be investigated in order to establish the 
dependence or independence of the given vector set. 

7. Alternative: reductions 

From the discussion in Art. 4, it should be clear that the reduction of 
a quadratic form to a sum of squares is identical with the transformation 
of its square matrix to the diagonal form. The latter problem is discussed 
in Art. 10, Ch. II, where it is shown that when the matrix G is sym- 
metrical, reduction of it to a diagonal matrix 2) is accomplished by the 
congruent transformation 

X G X S = 2) [54] 

in which S is a nonsingular matrix, expressible as the product of ele- 
mentary transformation matrices. 

According to Eqs. 16 and 25, it is clear that the corresponding trans- 
formation from the variables Xj • • • jr„ to the new variables x\ • • • x\ 
is given by 

x] = 2jc'] [55] 

Hence if the diagonal matrix is written 




0 

0 • 

• o '] 

3 ) = 

0 

^22 

0 

■ • 0 


.0 

0 




then the quadratic form F, Eq. 2, expressed in terms of the new variables, 
reads 

P * diix'i^ + d22x' 2^ 4 - • • ■ + dnnx'n [ 57 ] 

More generally, if the rank of the matrix Q. (rank of the quadratic 
form) is r, the diagonal matrix 3), after suitable arrangement of its rows, 
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has the form 



and the reduced quadratic form has r terms; thus 

F = 4- + • • • + drrx\^ [59] 

The matrix G, however, possesses no unique diagonal form. According 
to the procedure given in Art. 10, Ch. II, there are any number of trans- 
formation matrices which can take the place of S in Eq. 54 and also 
effect a reduction of Q to a diagonal form. For example, another non- 
singular matrix (P may be found such that 

fPt X Ct X = 9' [60] 

in which 3)' is also a diagonal matrix. The transformation of variables 
leading to Eq. 60 may be represented by a relation similar to Eq. 55, 

x] = iPx"] [61] 

Correspondingly, there are any number of reduced forms for F, such as 
the one giv'en by Eq. 59. 

fn this connection it is important to note, however, that if the rank of 
the matrix G is r, all the diagonal matrices to which it may be reduced 
by a transformation of the form given by Eqs. 54 or 60 have the same 
number of nonzero diagonal elements (namely r), because all the trans- 
formation matrices, 2, S^, • • • etc., are nonsingular. Consequently, all 
various diagonal matrices are equivalent to G, and equivalent matrices 
have the same rank (see Art. 9, Ch. II). 

Not only do the diagonal matrices £2) and £2)' have the same number of 
nonzero elements, but they must have the same number of positive 
elements as well. This important property may be demonstrated by first 
assuming the contrary, and thereby arriving at a contradiction. 

Since the variables x'] and x"] are uniquely related to the original 
variables x] by nonsingular transformations 55 and 61, respectively, the 
quadratic forms resulting from the diagonal matrices £2) and £D' must be 
identically equal for all corresponding values of x'] and x"]. If the 
elements of £2) are now written and those of £D' are written d'*, the 
identity mentioned above becomes 

dx{x\)^ + • • • + + d,+dx'^x)^ + ■■■+ drix'rY ^ 

+ . . . + d\{x",Y + + • • • + d'r{x"rY [62] 
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Let the notation be chosen so that the first /x terms in iD and the first v 
terms in £2)' are the positive elements in these matrices. The assumption 
that one of these numbers is greater may be stated by requiring either 
fx > V or p > fjL, It is again merely a matter of notation to assume p < 
The follo\\nng spx?cial choice of variables in Eq. 62 is now 

made: 


— • • • — — 0 
.r"i = .t" 2 = • • = x", = 0 


[63] 


In terms of the original variables .v] in Eqs. 55 and 61, the conditions 63 
yield p + {n — fi) < n equations in n unknowns. It is then surely possible 
to find a solution for .v] in which not all the variables are zero (Art. 7, 
Ch. III). Moreover, the fact that fP and S are nonsingular means that 
values of • • • .r"„ and .v'l • • • exist such that neither all the .r", 
nor all the .r',- are zero. These special values of .v'] and .r"] are now sub- 
stituted into Eq. 62, and the fact that rfj, • • ■ , > 0 while • • • , 

< 0 is emphasized by using absolute values of the latter. The identity 
62 then becomes the equation 

+ • • • + d^(x'^y = — — . . . — |(/',.| (x"r)“ [64] 

The result 64 is clearly impossible, since the left side is definitely greater 
than zero and the right side cannot jiossibly be greater than zero. It is 
therefore necessary to conclude that p — Mi since the assumption of an 
inequality leads to the contradiction in Eq. 64. 

All the various reduced forms for F, like the one given by Eq. 59, 
therefore, have two things in common regardless of how this reduction is 
accomplished. The total number of terms is always equal to the rank r, 
and the numbers of positive and of negative coeflicients are always alike. 
This result is known as the lazc^ of inertia of quadratic forms. 

If the number of positive coefficients in a reduccxl form is denoted by 
P, and the number of negative ones by .V, evidently 

r^P + N [65] 


The difference 

s = P - N [66] 

is called the signature of a quadratic form. Both the rank and the signa- 
ture are thus seen to be invariant to the congruent transformation given 
by Eq. 25, in which C is any real nonsingular matrix. 

Although there are any number of matrices like S, £P, • **• , etc., which 
reduce the matrix (i to a diagonal form and F to a sum of squares (ac- 
cording to the congruent transformation 54 or 60), there is essentially 
only one orthogonal matrix: which effects such a reduction, namely the 
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modal matrix £ defined in Art. 10, Ch. Ill * Only for this orthogonal 
matrix do the coefficients of ¥ in its reduced form equal the latent roots 
of 0. In other words, if the elements d 22 y * * * dnn in the reduced form 
given by Eq. 57 are equal to the latent roots Xi, X 2 , • • • Xn respectively, 
the matrix S in Eq. 54 is the orthogonal modal matrix £. 

The congruent transformation, which applies to the transformation of 
the variables in a quadratic form (as discussed in Art. 3), should not be 
confused with the collineatory transformation discussed in Art. 10, 
Ch. III. Since the matrix 0 of a quadratic form is symmetrical, its modal 
matrix £ is orthogonal (as shown in Art. 12, Ch. III). The collineatory 
transformation £""*Cf£, which carries (i over into the diagonal form of 
its latent roots, is then identical with the congruent transformation 
£,Cf£, because the trans[X)se and the inverse of an orthogonal matrix 
are the same. 

However, if the voliimns of £ are multiplied by arbitrary nonzero 
factors, the collineatory transformation still accomplishes the same 
result (as {X)inted out in Art. 10, Ch. Ill), but the matrix obtained by 
this modification of£ is no longer orthogonal, ’‘and a congruent transforma- 
tion of Cf by means of it does not reduce 0 to a diagonal form and hence 
does not reduce the corres|K)nding quadratic form to a sum of squares. 

Thus when a collineatory transformation reduces the s>mmetrical 
matrix 0 to the diagonal form of its latent roots, the matrix effecting 
this reduction is not necessarily an orthogonal one (as in the case of a 
congruent transformation), but may be an orthogonal matrix post- 
multiplied by an arbitrary nonsingular diagonal matrix. 

8. Definite quadratic forms 

' If all the coefficients dkk in the reduced form for F are positive or all 
negative, it is evident that the quadratic fo'm is positive or negative 
respectively for all possible nonzero values (> the variables. This result 
is true whether the variables are the x'k appearing in the reduced form 
or the Xk related to the variables .v\- by a nonsingular transformation 
like Eq. 55, because this transformation amounts merely to a change in 
notation. Such quadratic forms are referred to as being either positive or 
negative definite. 

If a quadratic form is positive definite, all the latent roots of its matrix 
are positive. Conversely, if all the latent roots are positive, the quadratic 
form must be positive definite. Incidentally, it is worth noting in this 
connection that the quadric surface of a positive definite form must be 
ellipsoidal. 

•The only freedom in the construction of the matrix £ occurs when there are repeated 
latent roots, and the directions of some of the principal axes of the corresponding quadratic 
form are not uniciue. 
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When the quadratic form is positive definite, so that all the coefficients 
dkk in Eq. 59 are positive, the further congruent (real) transformation 

x'k = (for k = 1, 2, • • • r) [67] 

dkk 

evidently reduces F to the so-called canonic form 

F = + .v",*-’ -f . . . + .r",2 [68] 

in which all the coefficients are -f 1. The requirement that F be positive 
detinite is necessary in order that the transformation given by Eq. 67 be 
real. Hence the statement: -1 positive definite quadratic form {singular or 
nonsingular) may alu\iys be reduced to the canonic form, given by Eq, 68, 
by means of a real nonsingular congruent transformation. 

9. A CRITERION FOR POSITI\T DEFINITENESS 

If the quadratic form given by Eq. 2 is ix)sitive definite, it is readily 
seen that regardless of its rank, none of the diagonal elements of its 
matrix can be either negative or zero. For, if Okk were such a zero or 
negative element, then setting all the variables excej)! Xk and x, equal to 
zero, and letting x, = 1, would give the form the value 

F = Oti + lOxkXk + (ikkXk“ [ 69 ] 

which, for Okk ^ 0, can certainly become negative by a suitable choice 
for the value of Xk- Although the condition 

akk > 0 [70] 

is a necessary one for positive definiteness, it is not sufficient. 

A useful set of necessary and sufficient conditions may be found 
through the following method of representing a positive definite quadratic 
form as a sum of squares. If the quadratic form is assumed to be non- 
singular and positive definite, the canonic form reads 

f = /i^ + xV + ---+^V [71] 

and the variables x\ — • x\ must be related to the variables Xi ^-Xn 
by a linear transformation such as 

Pn^l + Pl2^2 + * * • + pln^n = 

p2\^l + P 22 X 2 + • * • + p2nXn = X 2 1 *^ 2 ^ 

Pnl^l “b Pn2^2 T" * ’ ' Pnn^n ^ ^ n 

The matrix fP of this transformation is restricted only by the condition 

that it be nonsingular and real; that is, the coefficients pik must be real 
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numbers. In matrix form, Eqs. 72 read 


II 

[73] 

and hence 


^ X 9x] 

[74] 

Identifying this result with Eq. 21 for F. it follows that the matrix Q is 

given by 


a ^ 9, X 9 

[75] 


If P\ - ' ■ Pn represent the transposed vector set of Eq. 75 

shows that the elements of the matrix Q are given by the scalar products 


= pi' • Pk [76] 

It thus becomes clear that the elements of the matrix of a nonsingular 
positive definite quadratic form are necessarily determined from a 
linearly independent set of vectors in a manner similar to that which 
yields the elements of the Oramian determinant given by Eq. S3. The 
sufficiency of this conclusion is appreciated from the fact that if, for a 
given matrix Cf, a nonsingular matrix satisfying Eq. 75 can be found, 
the transformation of variables given by Eq. 73 reduces the equations 
having the matrix Ct (Eqs. 1) to a set of identities. That is, 
(the unit matrix of like order), and all the diagonal 
elements of are + 1 , The existence of such a matrix is thus recognized 
to be the necessary and sufficient condition for the positive definiteness 
of the quadratic form associated with the nonsingular matrix (?. 

Incidentally, it may be noted that the elements of the matrix 9 of the 
fundamental metric tensor (see Art. 5, Ch. Ill), are formed in a similar 
manner. The latter is, therefore, the matrix of a positive definite quadratic 
form. 

In particular, it may be observed that the elements on the principal 
diagonal of Q are given by 

a** = •/>*' = b.? [77] 

These are equal to the squares of the absolute lengths of the vectors. 
According to the remarks made in the opening paragraph of this article, 
none of the vectors Pt‘ is, therefore, allowed to be zero. This, as well as 
the condition that the vector set pt' be linearly independent, is assured 
by the requirement that iP be nonsingular. 

It should now be observed that the assumption of a more special form 
for the matrix does not subject the above argument to any restrictions. 
Thus the vector pi' in the transposed vector set of 9 may be assumed to 
be coincident with axis 1 of the rectangular reference system. The second 
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vector p 2 is chosen to lie in the plane (leterniined by the axes I and 2; the 
third vector />./ is then oriented so as to he confined within the three- 
dimensional subspace determined by the axes 1, 2, and d, and so on. 
This procedure in no way restricts the generality of the vector set. 
Pi* ' ‘ ' Pn\ but merely amounts to a particular orientation of the reference 
axes relative to the given vector set pk. 

The result of this choice of orientation for the reference axes is that 
/>/ has no comjx^inents other than that on the axis 1; p 2 has no com- 
ponents other than those on the axes 1 and 2, and so forth. The matrix 
then assumes the s{K‘cial triangular fonn 


f/’n 

PV2 

/>,:< • • • 

pin 

0 

P2> 

/>■.. • ■ • 

pin 

0 

0 

/>x. ■ • ■ 

P'.i M 

_0 

0 

. . . 0 

Pnn^ 


The real nonsingular matrix fP having this form, the [)roblem now is to 
discover the conditions which are im{x)sed uiK)n the given matrix 01 by 
the requirement that it shall have the representation expressed by Eq. 75 
(according to which its elements are given by K(|. 76). The most direct 
way of discovering these conditions is to proceed with a detennination of 
the matrix 9 for a given matrix Cf. A ix)ssible procedure for this determina- 
tion is suggested by the method of reducing an arbitrary symmetrical 
matrix to a diagonal form by means of the congruent transformation 54 
(as discussed in Art. 10, Ch. II, and there expressed !)> Eq. 189). Ac- 
cording to this method, the matrix ^2 effecting the reduction may have 
the same triangular form as in Eq. 78 if (as is true in the present 
problem) none of the diagonal elements in the given matrix Cf are zero. 
The matrix S, moreover, has diagonal elements which are all -f 1. 

With reference to Eqs. 54 and 75 one may write 

Q = = fP,fP [79] 

Since the inverse of the matrix f2 again has the same triangular form with 
diagonal elements which are all + 1 (as may readily be seen from any 
method of matrix inversion, for example, the one discussed in Art. 6, 
Ch. II), the formation of the matrix fP is made evident by Eq. 79. Thus, 
denoting by the diagonal matrix whose diagonal elements are the 
square roots of the respective elements of £/), one has 

g> ^ 2 ) 1 / 22-1 fgO] 

The significant part about this result is that the diagonal elements of fP 
are those of 3)*^^. The diagonal elements of 3), therefore, are the squares 
of the diagonal elements pu, P 22 , * * • pnn of 9, The positiveness of these 



Arf. 9] 


A CRITERION FOR POSITIVE DEFINITENESS 


153 


squared elements is thus seen to be the necessary and sufficient condition 
for the positiveness of the diagonal elements in and hence for the 
positive definiteness of the quadratic form having the matrix (2. It 
remains to express this requirement in the form of conditions upon the 
elements of the matrix (2. 

The determinant P of the triangular matrix in Eq. 78 is equal to the 
product {p\\p 22 • • • Pnn) of its diagonal elements, none of which, because 
fP is nonsingular, is allowed to be zero. According to Eq. 75, the de- 
terminant A of the matrix (i is 

A = P" — {pnp22 * * • PnnY [81] 

Now if the variable x,, is set equal to zero, x' n (according to the trans- 
formation 73 and the special form of 9 given by Eq. 78) becomes zero 
also. The quadratic form F then appears as a function of the variables 
.V| • • • .v„_i only, or in the reduced form as a function of the variables 
.v'l ■ • • only. The elements in the last row and column of the matrix 
Cf or of the matrix fP then have no further influence up)on the values of 
F in Ecj. 2, so that the latter may be regarded as a quadratic form in 
n — 1 variables with a matrix obtained through striking out the 
last row and column in Cf. ('orresp)ondingly, the last row and column in 

may be struck out, and the remainder denoted by If Pn-i is the 
determinant of and that of (2n_i, the same reasoning as before 
shows that 

An-\ = P n-\“ = {p\\p22 ' ' ’ Pn-^\Y [82] 

In a like manner, through also setting Xn- 2 , etc., equal to zero, 
it may be seen that the determinants An-z, etc., which are ob- 

tained by striking out the last two, three, etc., rows and columns in A, 
are given by 

A — P n—2“ “ ^P\\p22 * * ' ^n— 2 )^ 

= Pri-3” = {Pllp22 • ' Pns)^ To^l 


A I = dll = PlC 

From these equations it follows that 

pii“ = Ai = ail 
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The necessary and sulVicienl conditions that a nonsin^ular quadratic 
form F with the discriminant A be jx^sitive definite are, therefore, stated 
by the inequalities 

.1 > 0, A „_i > 0, .1 > 0, • • • vl 1 = tin >0 [85] 


.4 nonsingiilar quadratic form is positive definite if the discriminant and all 
its principal minors have positive nonzero values. 

The relations 84 together with Kq. 76 afford a method for determining 
the triangular matrix of K(|. 78 whereby the (juadratic form may be 
reduced to the canonic form giv(‘n by l-'.q. 71. 4'hus the diagonal elements 
in are given by Eqs. 84 directly. Xext, Kcj. 76 shows that 

"n = Pilpik f<’r k = 2, 3, • • • n [86] 

from which the remaining elements in the first row of are determined. 

Using Eq. 76 again, one finds 

njh — Pl2plk “e p22p2k ^ ~ 4-, • * • u [87] 

Here p 2 k is the only unknown, and so the remaining elements of the second 
row of fP can be calculated. From 

^3k — P\3p\k p23p2k 4" Pn.ipiih for = 4, 5, • • • w [88] 


in which p:a- is the only unknown, the remaining elements in the third 
row of .9 are found, and so forth. 

If the quadratic form F is singular more si)ecifically, if its matrix Cf 
has the rank n — p then the criteria for [)ositive definiteness are the 
same as those given by the relations 85 except that the equals sign is 
included with the first p inequalities. 

Thus if (2 is the matrix of a ix)sitive definite quadratic form of rank r, 
it may be reduced to its canonic form by means of a real nonsingular 
triangular matrix 2 in the congruent transformation 


10 0 0 . • 0 
0 10 0 . • • 0 




& = 


0 0 • • 1 

0 0 


0 • • • 0 
0 


0 0 • • ■ 0 


[89] 


Here the canonic matrix 6 has r units (-f 1) on its principal diagonal. 
The matrix 2 has the same triangular form as 9 in Eq. 78. 

The inverse of 2 again has the same triangular form, and hence may be 
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identified with 9’ of F^q. 78, so that the singular matrix (2 is represented by 

tt = [90] 

in place of Lq. 75» which holds only when (2 is nonsingular. 

IFearing in mind the form of the canonic matrix S as shown in Eq. 89, 
and applying the same reasoning as in the previous argument in which 
Cf is assumed to have the rank n, one may establish the above statement 
regarding the modification of the criteria 85 for any rank n - p. 

10. The iter.ated (^uadratk form 

The*quadratic form whidi results when the matrix G of E is replaced 
by the A’th {xiwer of C( is referred to as the iterated form of i^th order, or 
as the A’th iterated form of F. If £ is the orthogonal modal matrix of G, 
as pointed out in Art. 1 1, ('h. Ill, 

£,G“£ = £,G££,G£ = A- [91] 

in which A is the diagonal matrix of the latent roots of G. 

Since 



the latent roots of the second iterated form of F are equal to the squares 
of the latent roots of F. More generally, the latent roots of the Hh 
iterated form of F are recognized as being equal to the A’th powers of the 
latent roots of F, 

It is also clear that the modal matrix £ which transforms F to its 
normal form (sum of squares) also transforms any of the iterated forms 
of F to their normal forms, and that the latter differ from the normal 
form of F only in that the coefticienls of the sriuare terms appear raised to 
the A’th {X)wer. 

It appears that all the iterated fomis of even order are positive definite^ 
and that their asscuiated quadric surfaces are entirely ellipsoidal in 
character. Moreover, the principal axes of these ellipsoidal quadrics 
coincide in direction with those of the surface associated with F, since 
the modal matrix £ delines these directions. 

Hence in searching for the principal axes of a quadric surface F = 1, 
it is immaterial whether the given form F or any of its iterated forms is 
considered. With regard to the method discussed in Art. 4, it is thus 
recognized that the argument is not subjected to any loss in generality by 
the assumption, for the purpose of clarifying the geometrical visualization, 
that the given quadric surface is entirely ellipsoidal in character. 
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U. The SlMl LTANEOrS rediktion of a pair of quadratic forms 

TO SI MS OF SQl'ARES 

Two quadratic forms are considered to be given by the matrix equations 


Fi = .r (?.v] 

[93] 

F., = ^H\x] 

[94] 


In the present discussion, it is assumed that at least one of these forms is 
positive detinite and nonsingular.* 

The fundamental principle u[X)n which the simultaneous reduction is 
based is the following. If one of the forms, for e.xample Fi. is nonsingular 
and positive detinite, the ass(^ciated cjuadric surface may be visualizefl 
as that of an ellipsoid. The variables Xi • • • .v„ of the two functions F'l 
and Fo are first subjected to the orthogonal transformation which intro- 
duces the princij)al axes of this ellipsoid as a new co-ordinate system. 
This transformation reduces Fy to a sum of s(|uares but does not in 
general eliminate the cross-product terms from Fi also. 

A further real transformation of the form given by Kq. 67 next carries 
F 2 over into its canonic form, and the a.s.sociated ellipsoid becomes a 
sphere. If Fy were not nonsingular, the resulting quadric surface would 
not be spherical; and if F^ were not [K)sitive definite, the transfonnation 
which reduces it to its canonic form would not be real. The resulting form 
Fi at this stage would then not be real either, and the following step 
could not be carried out by means of a real transformation. 

The equation of a sphere with its center at the origin is evidently in 
the normal form (sum of squares with coelhcients unity) regardless of 
the angular orientation of the rectangular co-ordinate axes. Any set of 
orthogonal axes can be principal axes for the si)here. Hence the variables 
in both quadratic forms can now be subjected to any orthogonal trans- 
formation without further affecting the form of F*. At this step an 
orthogonal transformation can be found which reduces Fi to a sum of 
squares (transforms to the co-ordinates which are the principal axes of 
the quadric surface defined by the form which Fi has at this stage), and 
F 2 remains in its canonic form. 

A transformation which combines these three steps evidently accom- 
plishes the desired simultaneous reduction of both Fi and F 2 . This 

•In physical problems, quadratic forms usually represent energ>' functions, and hence this 
assumption does not constitute a serious restriction from the practical standix)int. The alge- 
braically more complicated problem which results when these restrictions are not imposed, 
is not treated here (for an intr^xluction to this more general problem the readier is referred to 
M. Bocher, Introduction to Higher Algebra (New York: The Macmillan Co., 1927)]. 
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reduction is now formulated explicitly by means of the following matrix 
equations. 

If F 2 is positive definite and nonsingular, the discussion in Art. 9 shows 
that its matrix may be represented by the relation 

® = fp,g> [95] 

in which f? is a nonsingular real matrix. This matrix may be considered 
to be any one which satisfies the congruent transformation 

[96] 

^1 being the unit matrix ha\ing the .same order as .‘B. 

One method of finding is to determine first the orthogonal modal 
matrix £"” of which, by means of the c<>i\gruent transformation 

f/'-ffir'* = [97] 

reduces fB to the diagonal fonn of its latent roots. This diagonal matrix 
is denoted by At,. Xow if - represents a diagonal matrix with diagonal 
elements ecjual to the S(iuare roots of the latent roots of JB (none of which 
is zero), 

.\r' -£,""JB£'^'Ar‘'- = % [98] 

A comparison of Kqs. 96 and 98 yields 

= .tfc' -£/« [99] 

However, since 

= qi [100] 

in which 0 is an arbitrary orthogonal matrix of the same order as % and 
fB, it is readily seen that 9"’ may more generally be represented as 

fB = OAfc' -£/'’> [101] 

The matrix fP may be found by this method, or perhaps more simply 
by means of the process discussed in Art. 10, Ch. II. Ihis part of the 
simultaneous reduction of ft and F-, is not unique and may be accom- 
plished by whatever procedure appears to be the most expedient under 
the given circumstances. 

If the variables x, ■ ■ ■ x„ in the quadratic forms given by Eqs. 93 and 
94 are subjected to the transformation expressed by the matrix equation 

x] = ff-'x'] [102] 

the matrices of these quadratic forms are subjected to the congruent 
transformations 




[ 103 ] 
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and 

JPr'39^^ = ^ [104] 

Fo is thus reduced to its canonic form. Fi at this stage has the matrix 
which exidently is still real and symmetrical. Hence it possesses an 
orthogonal modal matrix which carries it over into the diagonal 

form of its latent roots; thus 

M ^ 

These roots are called the latent roots of Q unth respect to fB. Since is 
symmetrical, these latent roots are real. I'hey are positixe if (i (as well 
as is the matrix of a positive definite quadratic form (see Art. 7 of 
this chapter).* 

The further transformation of variables 

A-'] [106] 

with the orthogonal matrix f leaves F 2 in the canonic form 

F 2 = -r (-v'^2)‘' 4- . . . -f- [107] 

and reduces Fi to the sum of squares 

Fi = H- + • • • -f [108] 

in which X/® • • • X„*'' are the latent roots of Ci with respc*ct to fB. 

These are the roots of the characteristic ecjuation 

X^l; = - xqil = 0 [109] 

Since the latent roots of a matrix are invariant to a collincatory trans- 
formation (see Art. 10, Ch. Ill), and 

= Q9-^9r^ = artr* [no] 

or 

cp-iQ(6)cp = = Cjs-^a [111] 

it follows that the characteristic equation 109 may alternatively be 
written in either of the forms 

jGfjfi-* - X^Uj = 0 [112] 

or 

iffl-'Q-xeUl=0 [113] 

Since ffi is assumed to be nonsingular, the equation 

|S(£B-*a - X<^)1 = 0 [114] 

•This fact b an important one in the study of oscillating systems in d>mamics and electrical 
network theory. 
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lias the same roots as Eq. 113. Hence the latent roots of (2 with respect 
to fil may also be defined as the roots of 

[d - XiBj = 0 [115] 

In other words, the characteristic equation for these latent roots may be 
written in any one of the four forms given by Eqs. 109, 112, 113, or 115. 

Since there are no restrictions on the symmetrical matrix (? (except 
that it i)e real, of course), some of the roots of the characteristic equation 
115 may be negative, or coincident, or zero. Observe that S and 
contained in K(|. 103 are equivalent matrices (because is nonsingular) 
and hence they have the same rank. Therefore, if the rank of the matrix 
a of Fi is r, the characteristic equation 109 or 115 has r nonzero roots, 
and the reduceci form for Fi given by Eq. 108 has r terms. The canonic 
form for /%, however, must have n terms because the matrix of F 2 is 
assumed to be nonsiiigular. 

12. An alternatu^ geometrical interpretation of the same 

PROliLEM 

So far in the discussions of this chapter, the various geometrical inter- 
pretations have assumed a rectangular co-ordinate system. Thus the 
equation E = 1 for the quadric surface associated ^\^th a given quadratic 
fonn is visualized geometrically by supposing the variables aq • • • Xn to 
l)c the co-ordinates of a jx)int with respect to a rectangular system of 
axes. The normal form for this equation, for which F appears as a sum 
of sfjuares with arbitrary real coellicicnts, is then familiarly recognized 
as yielding the quadric surface (ellipsoid, for example) with its orthogonal 
set of principal axes coincident with the co-ordinate axes. 

Although the tacit assumption of a rectangular co-ordinate system is 
quite in order, it is just as feasible to supp se that the reference co- 
ordinate system, which is used for the geoihetrical interpretations, is 
given by an oblique set of axes, but these interpretations must then be 
revised. 

First, in this regard, it is significant to observ^e that an ellipse, for 
example, remains an ellipse when the angle between the co-ordinate axes 
is allowed to depart from 90 degrees. The equation of the ellipse is 
supposed to remain fixed, and the co-ordinates and X 2 are then assumed 
to be the parallel projections of a point P upon the oblique axes; that is, 
they arc the contravariant components of the vector OP (O denotes the 
origin) which is the vector sum of its comp)onents according to the familiar 
parallelogram law of addition. If the given equation is plotted by laying 
off corresponding values of Xi and X 2 along such a set of axes, the geo- 
metrical form of the resulting figure is still elliptic provided it is eUiptic 
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when identically the same etjuation is plotted by laying? ofT .V| and Xo 
along a set of rectangular axes. 

Next, it should be noted that even if the ecjuation of the central ellipse 
is in its nonnal form, the principal axes of the ellipse need not coincide 
with any of the oblique co-ordinate axes. 'I'his fact the reader may 
readily establish for himself by plotting a simple numerical example on 
oblique axes. In this connection it should be observed that the equation 
of a circle in rectangular co-ordinates represents an ellipse w'hen plotted 
in oblique co-ordinates. 

According to the previous discussions of this chapter, the equation of 
a giv'en central ellipse is transfonned to its normal form by rotation of 
the rectangular co-ordinate system until it coincides with the principal 
axes of that ellipse. In view of the remarks of the present article, it 
appears that this reduction may alternatively be thought of as accom- 
plished through transforming to an oblique set of co-ortlinatc axes whose 
angular orientations relativ e to the given ellipse arc such as to yield this 
same ellipse by means of an equation which contains sejuare terms only. 

It is significant to note that there are an infinite number of pairs of 
oblique axes for which the equation of an ellip.se goes into the normal 
form. The distinguishing characteristic of a normal form is the property 
that a reversal of the sign of any particular co-ordinate of a point on the 
curv'e ahvays yields another point on the curve. If any line through the 
center of the figure is chosen as one of the oi)li(jue axes, the other axis 
required to yield an equation for the ellipse in normal form is uniquely 
determined. The symmetry requirement mentioned above may be met 
if the second axis is chosen to be the one bis<?cting all chords of the figure 
parallel to the first axis. Such a pair of axes is known as a pair of conjugate 
diameters, and it is a property of these lines that each bisects all chords 
parallel to the other. It is this property which meets the symmetry con- 
dition characterizing a normal form in an oblique set of co-ordinates. A 
simple way to find the second axis, when any first axis is arbitrarily 
assigned, follow’s from the realization that the tangent to the ellipse at 
either end of the first axis is a limiting form for the set of parallel chords 
bisected by that axis. Hence the second axis is assigned to be that line 
through the center of the ellipse which is parallel to a tangent to the 
curve at the extremity of the first diameter chosen. 

The clue to the simultaneous reduction of two quadratic forms to 
sums of squares also lies in these ideas. Thus, if two ellipses are given (the 
two forms F\ and F2 may for the moment be thought of as defining a pair 
of central two-dimensional ellipses) with arbitrary semiaxes and with an 
arbitrary relative orientation, it is possible to find a single pair of oblique 
axes such that both of the given ellipses may be described by means of 
equations which contain square terms only. 
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A sketch of any two su( h central ellipses will convince the reader that 
it is always ix)ssil)le to choose a first axis in such a way that tangents to 
both ellipses, at the points where this line intersects them, are parallel. 
The second axis is then taken parallel to these tangents, and in this set 
of oblique co-ordinates both ellipses have the symmetry property which 
indicates a normal form for the equations representing them. 

The feasibility of obtaining such a set of oblique axes in the general 
case is thus indicated, though the most expedient method of handling the 
problem analytically does not directly parallel the above geometrical 
discussion. 

With reference to the Eqs. 93 and 94 for the forms Fi and F 2 , these 
ideas are formulated more specifically by the statement that there exists 
an oblique co-ordinate system defined by the unit vector set Ci, f 2 , • • • Cn 
of the matrix t*, such that a transformation to the contravariant variables 
of this system (see Kq. 43, Art. 5, Ch. Ill) by means of the equation 

x] [116] 

and its transjKise 

jr = fC [117] 

simultaneously carries the expressions for these two quadratic forms 
over into 

= ^2), I] [118] 

and 

F 2 = = ^2)2?] [119] 

in which 2)i and 2*2 are diagonal matrices. 

The orientations of these oblique axes, define d by the vector set of the 
matrix (?, are found in the solution to the follow ing closely related problem. 
The linear transformations associated with the quadratic forms Fi and 
Fo may be written 

Qx] = y] [120] 

and 

SB*] = 2 ] [121] 

The first of these equations transforms a vector * into a vector y, and 
the second transforms the same vector x into a vector 2 . The question 
may be raised whether there are any particular directions for the vector * 
(relative to a fundamental rectangular co-ordinate system) for which its 
two transforms y and z are coincident in direction (but not necessarily 
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coincident with .r). This condition is expressed by the matrix equation 

y] = \z] [122] 

in which X is a constant multiplier. 

Substituting Eqs. 120 and 121 into Eq. 122 yields 

- Xffi).v] = 0 [123] 

The question, therefore, amounts to inquiring whether nontrivial solu- 
tions may be found to the set of homogeneous equations 

(an — X6n)-^‘i {*-^12 “ \bi2)x2 r • * * T (ai„ — = 0 

(^21 “b (<^22 ~ XZ>22)-'*2 "b * * • “h {(Jjn ~ X^2n )•'*« ~ 0 |^J24] 

(dnl — Xi„i).Vi (a„2 — X6 ,i2)-V2 + * * * + — X/^„„)a*„ = 0 

The condition for the existence of such solutions is that the determinant 
of this system of equations shall vanish, that is, 

Cf ~ XftV = 0 [125] 

This is identical with the characteristic equation 115 which determines 
the latent roots of G with respect to fB. If the determinant of Eejs. 124 
for a particular root X,^"* is written 

’a - [126] 

t 

and the cofactors of the fth row of this determinant are denoted by /.’a, 
the direction cosines of a vector x constituting a solution appropriate to 
the root X = X*^** are given by 

M 

tt. = fOF ife = 1 , 2, • • • « [127] 


provided all the X-roots of Eq. 125 are distinct. 

For such a vector x with the components Xj, a' 2 , • • • relative to the 
fundamental rectangular co-ordinate system, Eq. 123 may be written 

Qx] = [128] 


The n equations of this form, for the n particular vectors i, .t, • • • x 
appropriate to the n roots of Eq. 125, may be combined into one matrix 
equation by definition of the modal matrix 


'n 

fl2 ' ■ 

• fin 

21 

^22'* 

■ hn 

^nl 

fn2 * 

■ ' f nn 


[129] 


the elements of which are the direction cosines defined by Eq. 127. Then 
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the complete set of equations like the one given by Eq. 128 is contained in 

as = ffi£Aa/6 [130] 

in which Aa/b is the diagonal matrix of the latent roots of Q with respect 
to fB, that is, the same diagonal matrix as that appearing in Eq. 105. 

If Eq. 130 is premultiplied on both sides by the transpose of £, it reads 

= (£3£)Aa/6 [131] 

The resultant matrices £t(3£ and £/f5£ are necessarily s^onmetrical 

because Cf and fB are symmetrical. Equation 131 implies that after the 
columns of the symmetrical matrix £/£B£ are multiplied by a set of factors 
respectively, the resulting matrLx is still symmetrical. 
This condition can be jxDssible only if £,fl£ and £fSB£ are both diagonal 
matrices. Hence if these are written 

£,(?£ = 9i [132] 

and 

£3£ = % [133] 

then a comparison with Eqs. 118 and 119 shows that the matrix C has 

been found, namely, 

e = £, [134] 

The direction cosines of the set of oblique co-ordinate axes, in terms of 
which the equations of the two quadric surfaces associated with the 
forms El and Eo appear as sums of squares, are given by Eq. 127. More 
specifically, the direction cosines of the 5th oblique axis are the elements 
in the 5 th column of the matrix £ defined by Eqs. 127 and 129. The set 
of unit vectors characterizing the oblique axes with reference to the 
fundamental rectangular co-ordinate system .s the transposed vector 
set of £. 

Equations 131, 132, and 133 show that 

Aa/6 = 9)2 ~'Si [135] 

or, if the diagonal elements in 3)i and % are denoted by and 
respectively, that 

[ 136 ] 

The quadratic forms f i and F 2 given by Eqs. 93 and 94 are, therefore, 
simultaneously reduced to 

+ • • • + 


[ 137 ] 
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and 

p'l = 4 - • • * + [ 1 ^ 8 ] 

when the variables .Vi • • • a*„ are subjected to the linear transformation 

:r] = £{] [139] 

The matrix £ is, of course, not orthogonal, as is clear from the fact that 
it detines a set of oblique axes or as is alternatively made evident from 
Eq. 130, which can be written 

= A.,./, [140] 

It thus appears that £ is the modal matrix whicli reduces the dissym- 
metrical matrix to the diagonal form of its latent roots. 

As shown in the previous article, these latent roots are real, but if Cf 
has the rank r, then u — r ol them are zero. Since is nonsingular, none 
of the diagonal elements is zero, but E(|. 136 shows that as many 
elements of the diagonal matrix are zero as there are zero X-roots in 
the characteristic equation 125. Thus the reduced form 137 has r terms, 
and that given by Eq. 138 has n terms. 

Since F 2 is positive definite, all the coefticients in Eq. 138 are positive. 
A further real transformation of the variables then carries Fo and Fy over 
into the forms given by Eqs. 107 and 108 respectively. 


13. A FEW REMARKS REGARDING THE SIMULTANEOUS REDUCTION OF 
MORE THAN TWO QUADRATIC FORMS TO SUMS OF SQUARES 

In the solution of many practical problems, it would be verx' desirable 
to be able to reduce more than two quadratic forms to sums of squares 
simultaneously. To do so is in general not possible, however, as the reader 
can readily appreciate by considering an attempt to extend the reasoning 
in the opening paragraphs of Art. 11 or 12 to more than two forms. 

If p quadratic forms Fi, F2, ’ ’ Fp are given, it is, however, possible 
to reduce them simultaneously in the special case for which there exist 
p — 2 independent relations of linear dependence of the form 

7iFi + 72^2 + • * * + ypFp = 0 [141] 

in which at least one of the coefficients has a nonzero v'alue. The condi- 
tion 141 must, of course, hold for all values of the common variables 

• • • Xn. 

Cases of this sort which do occur in practice are the problems of reduc- 
ing the three forms Fi, F 2 , and F 3 , when 

Fi = 7F2 


[142] 
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or of reducing the four forms, f’,, Fo, F,, F4, when 

Pi = 1P2 [143] 


Evidently, a transformation which carries Fi and F^ over into their 
reduced forms simultaneously yields a similar reduction for F^ and F^. 

14. The abridgment of a quadratic form that results from 

IMPOSING linear CONSTRAINTS UPON ITS VARIABLES 

Consider for the moment a quadratic form F in only three dimensions, 
and visualize its associated quadric surface defined by F = 1 as a central 
ellipsoid. If one demands that one of the three variables Xi, X2, X2 be zero 
(for example, if one arbitrarily sets = 0), it is clear geometrically that 
the ellipsoid thereby is reduced to the two-dimensional ellipse given by 
the intersection of the ellipsoid with the co-ordinate plane normal to 
axis 3, i.e., the 1-2 plane. This ellipse is defined by an equation F = 

I in which F is obtained from F through simply dropping those terms 
involving .1*3. 

The equation X3 = 0 may be regarded as a linear constraint which 
restricts the original variables to v^alues corresponding to points on the 
intersection of the ellipse with the 1-2 plane, the latter being referred to 
as the constraint plane. F, which is a quadratic form in only two variables, 
is spoken of as an abridged form of F. 

'rhe abridged form is not always so easily found as the very simple 
situation just described. Thus, with reference to the same three-dimen- 
sional form F, suppose that the constraint plane is chosen as an arbitrary 
one, still passing through the origin of co-ordinates, however. In this case 
the equation of constraint is changed from th< simple form 0:3 = 0 to 

Pzi^'i + PZ2X2 + ^33^3 = 0 [144] 

II Paij A32, A33 are regarded as components of a vector p^, and Xi^ X2, 
as those of a vector x, Eq. 144 demands that the vector x be orthogonal 
for the vector p^, which, in the simpler case considered above, is coincident 
in direction with co-ordinate axis number 3. The constraint plane is 
normal to the vector ^3, and may be fixed at will through an appropriate 
choice of this vector. 

The abridged quadratic form F is such a function of two variables that 
F = 1 becomes the equation of the elliptic intersection of the original 
ellipsoid with the chosen constraint plane. Since the variables involved in 
F must refer to a two-dimensional orthogonal co-ordinate system Ijdng 
within the constraint plane, it is clear that these variables are not simply 
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two of the original ones, 'lb find F one must first determine a new set of 
orthogonal co-ordinate axes l', 2', such that one of these, say axis vV, 
is coincident with the vector />a (normal to the constraint plane). The 
new axes and 1* will then lie in the constraint plane; and in terms of the 
new variables Xi, X 2 , referring to the new axes, it becomes clear that 
the constraint 144 is expressed by the simple relation — 0. 'fhe problem 
is thus reduced to the simple form considered above. The abridged form 
F is found by first subjecting the original variables in F to the orthogonal 
transformation appropriate to changing from axes 1, 2, 6, to axes l', 2', 3', 
and then dropping all terms involving x.i. 

The crux of the procedure lies in finding the proper axes 1', 2', 3', and 
from these the appropriate orthogonal transformation. Since axis 3' is 
coincident with the vector p-^ defined by the constraint Eq. 144, the 
problem is essentially that of associating with this vector />;i, two other 
vectors pi and pi such that the three together form a mutually orthogonal 
set. To carry out this procedure, one would begin by finding first a vector 
po normal to and then a vector Pi normal to the other two. Since 
there exists an infinite number of vectors pi normal to /> 3 , it is clear that 
the procedure as a whole is not unique, and there exists an infinite number 
of functions F of which any one may appropriately be called the abridged 
form of F for the stated constraint. 

The details of the procedure just described are best made clear through 
a numerical example. Suppose one has given 

E = -f- 2x2^ + 3x3^ [145] 

and the linear constraint 

+ ^2 + X3 = 0 [146] 


Choosing F in the form of a sum of squares does not detract from the 
generality of the procedure to be discussed here. 

The constraint vector p^ has the components [1, 1, 1]. A procedure for 
finding vectors pi and pi such that the three vectors form a mutually 
orthogonal set may be patterned after the methods discussed in Art. 7, 
Ch. III. In this way the following matrix is readily found : 


= 


1 - 12 “ 
1 ~1 0 

1 1 1 


[147] 


the components of the vectors pi, pi, ps being defined, respectively, by 
the elements of the first, second, and third rows. As already mentioned, 
the determination of this matrix is not unique. It may readily be 
checked by inspection that the three rows of f? define a mutually orthogo- 
nal set of vectors. 
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Through dividing the comixjiients of each vector by the square root 
of the sum of the squares of its components (called normalization) one 
obtains a corresponding set of unit vectors, whose matrix is orthogonal 
(defines mutually orthogonal unit vectors by columns as well as by rows). 
This orthogonal matrix, which reads 


0 


-0.408 - 0.408 0.817 

0.707 -0.707 0.0 

0.577 0.577 0.577 


[148] 


yields the desired transformation from the co-ordinates Xi, JC 2 , to 
^ 1 , ^ 2 , ^ 3 , thus 

0 Xx] = x] [149] 


Hence, according to discussion earlier in this chapter, the quadratic 
form in terms of the new variables has the matrix 


a = 0 XQ X Ot 


where (? is the matrix of the given form. 

In the numerical example considered here, one has 


a = 6 X 


”10 0 
0 2 0 
0 0 3 


X Ot 


[150] 


[151] 


and with Eq. 148 this yields 


Cf 


2.5 

0.2885 

0.707 


0.2885 0.707 

1.5 -0.408 

-0.408 2.0 


[152] 


Deleting the third row and column, one has the matrix of the desired 
abridged quadratic form, which reads 

F = 2.5Ji‘ -f 0.577 xiX 2 -f- 1.5x2^ [153] 


In a more general case in which the given quadratic form F involves 
any number of dimensions (say n) and t < n linear independent con- 
straints are specified by the equations 

Pr\X\ + pr2X2 + * * ' + Prn^n = 0 

pr-{.\,\Xi -f ,2-^*2 "h ’ * * + = 0 [154] 

pnlXl -(- Pn2^'2 -f- • • • -f* pnnXn = 0 

in which r = w — / + 1, one has given t vectors pry with 

comix)nents defined respectively by the coefficients in these equations 
taken by rows. The / planes orthogonal to these vectors are the constraint 
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planes, and the desired abridged quadratic form should have its variables 
confined to lie in the intersection common to these planes. 

Geometrically it is now convenient to think of the / independent vectors 
pr, pr+\^ • • • />n as occupyiiig a /-dimensional subspace immersed in the 
/^-dimensional one. The common intersection of the constraint planes 
then occupies a (/i — /)-dimensional subspace, and these two subspaces 
are mutually orthogonal to one another. One must find a new orthogonal 
set of co-ordinate axes such that the first (;/ — /) of these lie in the 
{n — /)-dimensional subspace, while the remaining / of them lie in the 
/-dimensional subspace occupied by the vectors pr^ pr+\, - ‘ ' pn- In this 
new co-ordinate system the constraint equations 154 become simply 
Xr = = ■ • • = x„ = 0, and the variables xi, • • • ^n-t must lie in 

the common intersection of the constraint planes because these lie in the 
(n — /)-dimensional subspace. 

The first step in the process of finding an aj)propriate set of new co- 
ordinate axes is to determine a mutually orthogonal .set of / vectors that 
occupy the same subspace as the constraint vectors pr. pr+iy • • • pn- If the 
latter happen to be mutually orthogonal, this first step is already done 
(as may be said of the single constraint case discussed above), but in 
general the constraint vectors are any independent set, not necessarily an 
orthogonal one. 

A mutually orthogonal set of vectors occupying the /-dimensional 
subspace is any set of / mutually orthogonal vectors each of which is 
expressible as a linear combination of the given constraint vectors. Such 
a set may be formed in an infinite variety of ways. One may begin, for 
example, by choosing pr as the first of the desired .set. Then one may 
determine a second vector, orthogonal to pr, and expressed as a linear 
combination of pr and />r+i- Next one determines a third vector through 
demanding that it be orthogonal to the first two already found, and 
expressed as a linear combination of pry pr^iy pr+ 2 y and so forth. 

Specifically this process may be indicated as follows, letting the desired 
mutually orthogonal vector set be denoted by pr, pr+i, * * * pn- To begin 
with, one chooses pr = pr- Next 


Pr-\-\ — OlpT + PPr-\-l 

pr • PrJfl = Otpr * Pr + Ppr ‘ pr^l = 0 


[155] 


in which a and p are nonzero. Here P may be chosen arbitrarily. Thus 
letting = — 1 gives 


pr * 

a = 


Pr h 


[ 156 ] 
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and thus prJ^.\ is found. Then one writes 

pr\2 — «Vr + ii^' pHi + y^pr+2 

Vr • Pr+2 = 0 [157] 

Pr-fl • Pr4-2 = 0 

in which y may be chosen at will and the last two equations solved for 
a and thus determining prAri- 

In the next step one will have to solve three equations simultaneously 
for three unknown coefficients, etc. If the number of constraints is large, 
the computations may become tedious, but they remain straightforward. 
There are, of course, other ways in which the desired set of t mutually 
orthogonal vectors Pr • • • pn may be found, the method given here being 
rather simple in principle and no more tedious computationally than any 
other. 

One must now associate with these / vectors, n — t additional ones so 
as to obtain a complete set of n mutually orthogonal vectors pi ■ ' pn- 
This second step is carried out as in the single constraint case. The 
directions of these n vectors are those of the desired co-ordinate axes. 
The orthogonal transformation from the original co-ordinates to these 
new co-ordinate axes has a matrix 0 whose vector set is pi |pi |, p 2 ^\p 2 \^ 

• • ' pn '\pn\\ that is, a set of unit vectors coincident with pi • • Pn- 
Equation 149 expresses the co-ordinate transformation, and Eq. 150 
gives the matrix of the quadratic form F in the new variables Xk in terms 
of the matrix of F for the original variables Xk- In terms of the new 
variables 

F = ^ X a X x] [158] 

and the arbitrary linear constraints, expressed in terms of the original va- 
riables by Eqs. 154, are simply Xr = Xr+i - • • = x,, = 0. The desired 

abridged quadratic form is that part of F remaining after terms involving 
the last / variables are discarded. 

Perhaps it might be well to restate what has been done above in another 
way. Specification of the linear constraints 154 is equivalent to demanding 
that the vector x with components Xi • • • x« is no longer free to assume 
any orientation in space, but is required always to be simultaneously 
orthogonal to t arbitrary and independent vectors • pn- Now these / 
vectors occupy only t of the n available dimensions of the given space. 
Hence, so long as the vector x moves about so as to stay outside the /- 
dimensional subspace occupied by the constraint vectors, it will fulfill 
the stated orthogonality restriction. For example, for » = 3 and / = 1, 
the vector x is free to move in a plane normal to the single specified con- 
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straint vector; for = 3 and t = 2, the vector x is restricted to the one 
remaining dimension, defined as that direction in space which is orthog- 
onal to the plane (/-dimensional subspace) determined by (or occupied by) 
the two constraint vectors. 

When n is larger than 3, one must be mentally sulViciently adaptive in 
one’s thought to comprehend the analogous geometry implied by con- 
tinuing the identical algebraic reasoning beyond « = 3. For example, with 
w = 4 and / = 2, one must visualize a two-dimensional plane determined 
by the two constraint vectors as defining a corresponding two-dimensional 
subspace, and recognize that there are two other dimensions left over 
which define another subspace (the n — t dimensional one in this case) 
orthogonal to the plane determined by the constraint vectors in the same 
sense that if the vector x remains in this second two-dimensional subspace, 
it remains simultaneously orthogonal to the stated constraint vectors. 

Stated in terms of constraint planes, which are normal to their respec- 
tive constraint '/ectors, one must recognize that the {n — /)-dimensional 
subspace to which the vector x is restricted should be interj^rcted as a 
resultant common intersection of these planes. Again for n = 3 and 
/ = 2, the two constraint planes intersect in a line, so that only one 
dimension is left for the vector x to exist in. However, for n = 4 and 
/ = 2, the “ intersection ” of the two constraint planes becomes a two- 
dimensional subspace. In general, one must visualize the possibility of / 
planes in an ^-dimensional space as possessing a common intersection 
which is {n — /)-dimensional, the latter defining a subspace orthogonal 
to the one occupied by the constraint vectors. 

What one is asked to do in the abridgment process is to find n mutually 
orthogonal unit vectors in the ^-dimensional space such that / of these are 
linear combinations of the constraint vectors. These / vectors then clearly 
occupy the same subspace as the constraint vectors. Since the remaining 
(n — /) vectors are simultaneously orthogonal to the first /, they must 
be simultaneously orthogonal to the constraint vectors, and hence they 
define the {n — /)-dimensional subspace in which the vector x can move 
and still conform with the restriction that it be orthogonal to all the 
constraint vectors. 

If one chooses new co-ordinate axes coincident in direction with this 
set of mutually orthogonal unit vectors, the matrix of the linear trans- 
formation, expressing the co-ordinates with respect to the new axes in 
terms of those relating to the original axes according to Eq. 149, is clearly 
that orthogonal matrix having these unit vectors as its vector set. In 
terms of the new co-ordinates the original constraints are equivalent to 
setting equal to zero those variables that refer to the axes lying within 
the subspace occupied by the constraint vectors, because these simplified 
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constraint equations clearly demand no more nor no less than the original 
ones. 

While there exists an infinite number of possible sets of new co-ordinate 
axes fulfilling the conditions just stated, note that the two subspaces 
defined by such axes are unique. For example, with w = 3 and i = 2, any 
co-ordinate system of which two axes lie in the plane determined by the 
constraint vectors (the /-dimensional subspace) is acceptable. This plane, 
however, is fixed, and so is the remaining direction normal to it (the 
n — / dimensional subspace) no matter which one of the infinite possible 
choices one makes in determining a specific set of new axes. 

In summary one may say that the discussions of this article show how 
one can convert an arbitrary set of t linear constraints, such as those 
expressed by the Eqs. 154, into an equivalent set having the simple form 
of demanding that / of the variables be zero. The latter will, of course, 
not be the original variables but new ones related to the original ones by 
an orthogonal transformation. In the next article the object is to in- 
vestigate the elTect of applied constraints upon the latent roots of a quad- 
ratic form. Since these latent roots are unchanged if the variables in the 
quadratic form are subjected to an orthogonal transformation, the form F 
has the same latent roots when it is expressed in terms of the new variables 

• • • Xn as it does when it is expressed in terms of the original variables 

• • • Xn. One may therefore say that the effect upon the latent roots of 
F caused by imposing an arbitrary set of / linear constraints may be 
studied without loss in generality by considering only the simple case in 
which the constraints have the form expressed by setting / of the variables 
equal to zero. 

15 . The effect of constraints upon the latent roots of a 

QUADRATIC FORM 

In this discussion only the absolute valu.es of the latent roots are of 
interest. When the quadratic form is positive definite, these roots are all 
positive real numbers, and there is no need to emphasize specifically the 
fact that only their absolute values are to be considered. It is only in the 
more general case in which the latent roots may have negative as well as 
positive real values that such a distinction is necessary. However, since 
the discussion in Art. 10 regarding the iterated forms shows that the 
iterated form of even order can have only positive latent roots (which are 
those of the original form raised to an even power), it is clear that, 
although the present discussion be restricted to the consideration of 
positive definite forms, the conclusions reached apply equally well to the 
magnitudes of the latent roots in the general case. 
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In Art. 5 it is shown that the latent roots Xi, X 2 > * * * K of the form 

P' = h dikXiXk [159] 

may be regarded as the extrema of f subject to the restriction of the XkS 
expressed by 

i .V = 1 [160] 

^=1 

The correctness of this result may be made e\ident through considering 
the form F reduced to the normal form 

F = XjJi" -|- ^ 2 X 2 '' + • * * + ^nXn" [161] 

by means of the orthogonal transformation 

.v] = £J] [162] 

involving the modal matrix £. Because of the orthogonality of the latter 
transformation, the condition 160 is unchanged in form, that is, 

i A- = 1 [163] 

f: = 1 

In terms of F as given by Eq. 161, and the condition expressed by Eq. 
163, it is clear by inspection that the latent roots are extrema of F. More 
specifically, if the roots are numbered in such a way that 

X, > X. > X;, > • • • > K [164] 

one may see that the largest value of F for the condition 163 occurs for 
= 1 , X 2 = = • • • = Jr, = 0, and equals Xp The next largest sta- 

tionary value of under the same condition occurs for = Ir Ji = J.j = 

• ' • = Xn = 0 , and equals X 2 ; and so forth. 

If the variables Xi • • Xn in F are subjected to an arbitrary set of / 
independent linear constraints, the resulting abridged form F in n — / 
variables has n — t latent roots which may be numbered so as to conform 
to the sequence 

> ^2 > ^3 > ■ • • > K-1 [165] 

It is the object of the following discussion to establish relations between 
the magnitudes of the latent roots of F and those of F. 

The constraint equations, which are assumed to have the form 

Xi = X 2 = X 3 = • • • = = 0 [166] 

are alternately expressed in terms of the variables Xk (related to the Xk 
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through the orthogonal transformation 162) as 

n 

T. = 0 (5 = 1, 2, • • • 0 [167] 

k=\ 

in which the Ink are elements of £. 

Although F, subject to the linear constraints, has latent roots that are 
(lifTerent from Xi • • • Xn, nevertheless the expression 161 for F may be 
used to compute values of F corresf)onding to any values of the variables 
5i • • • and hence yields values for the abridged quadratic form pro- 
vided only that the assumed values for conform to the constraint 

equations 167. If in these equations one chooses to let XtJ ^2 — = 

• • • = Xn = 0, there results a set of / equations in / + 1 unknowns, which 
surely possess a nontrivial solution for xi • • in agreement with the 
condition 163. For such a set of x^-values, Eq. 161 yields 

F = XiXi*" + XoXo*" + • • • + X/_j_iX/_^i“ ^ X/^i(xi‘' + • • • = X;_j_i [168] 

as mav be appreciated by noting the inequalities in 164 and the condition 
163. 

Since F, subject to the linear constraints 166 or 167, is the abridged 
form f\ the result 168 shows that a possible value of the abridged form is 
at least as large as the latent root X/^.! of the corresponding unabridged 
form. If the maximum-minimum problem discussed in Art. S is applied 
to the abridged form I\ one observes that the maximum value of F, 
subject to a condition in terms of its variables similar to 160 or 163, is 
Xi. The next largest stationary value is X2, and so forth. Inasmuch as it 
has been shown that a possible value for F is at least as large as Xf_|_i, it is 
clear that the largest value Xi is surely as large as this, that is, 

Xi ^ X,+i [169] 

Suppose now that v additional constraints ire imposed upon F, making 
t + V constraints in all. The resulting abridged form, which may be 
denoted by F, can also be regarded as resulting from imposing the set of 
V constraints upon F. If the latent roots of F are denoted and numbered 
according to the sequence 

Xj > X2 ^ X3 > • • • > X^_f_y [^^^] 

.then one may write two additional relations similar to 169 which read 


Xl ^ Xi^>yu|.l 

[171] 

Xl ^ \v-\-l 

[172] 


In any one of the relations 169, 171, or 172 the equals sign holds only 
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if the constraints are chosen in a particular way. Thus for a particular 
form of the v constraints, the equals sign in 172 may be assumed to hold, 
but it will not simultaneously hold in 171 also as long as the original / 
constraints are considered to be arbitrary. Thus the relations 171 and 172 
are seen to yield 

Xi4-1 ^ [173] 

which establishes relations between all the latent roots of F and a like 
number of those of f , since the integer v can have any value from 0 to 
— 1 . 

Considering again the interpretation of the latent roots Xi • • • K-t of 
F in the manner that those of F are interpreted in Art. 5, bearing in mind 
that F is the result of imposing certain constraints upem F, and hence that 
all the extrema of F are smaller than the respective ones of F, which are 
attainable only if its variables are free to assume particular sets of values, 
one recognizes the following additional relation as being true: 

X, ^ X, (s = 1.2, • • • « - /) [174] 

Together with 173, one may summarize the results so far in the form 

X. S X, S X.+, [175] 

in which the index s may be given the integer values 1, 2, •••«—/. A 
useful relation between the n — / latent roots of F in terms of the n 
latent roots of F is thus established. 

A case of particular interest is that in which a single constraint is im- 
posed upon F. The latent roots of F and F are then related as expressed by 

Xi ^ Xi ^ X 2 § X 2 ^ X 3 s ^ X„_, ^ X„ [176] 

These inequalities are sometimes referred to as expressing the separation 
property of the roots of F and F. 

An interesting geometrical view of this last result is had through 
visualizing the ellipsoid associated with F, and the constraint plane as 
slicing the ellipsoid centrally at an arbitrary angle. The intersection of the 
ellipsoid with the plane yields an ellipse with principal axes whose lengths 
are clearly intermediate as compared with those of the ellipsoid. 

With reference to the pair of forms F\ and F 2 , discussed in Arts. 1 1 and 
12, it may be pointed out that the results of the present article apply also 
to the roots and those of an abridged pair of forms resulting from 
subjecting their common variables Xi • • * Xn to an arbitrary set of 
independent linear constraints. 
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PROBLEMS 

1. Write down the matrices of the following quadratic forms and determine whether 
or not they are singular. 

SXi^ - SXiXi + 3X1^3 4- 4XiJC4 + 2.^2“ + ^X2Xz - 2X2^4 + 5X3^ + 2iC3:C4 + 6X4^ 

F2 = 4 - 3^10^2 4 TxiXj + 9x2X1 4 9x2^ — 3x2x3 — 3x3X1 + 5:c3JC2 4- ^ 3 ^ 

2. Show that the matrix of a quadratic form can be written as 



respectively with 



4. For what type of transformation matrix is the collineatory transformation 
identical with a congruent one? Out of the following matrices, pick by inspection 
those that have this property: 



Check your selection by carrying out the transforn .ttion upon the matrix 



5. Compute the latent roots and proper values of the following quadratic forms: 

Fi = j'5[83.ri- 4 52 .x*iX 2 — 40.viJr3 -f 122.V2” + 20.V2af3 + 65x3"] 

F 2 = + 24jci.V 2 + i 2 O.v 1 .t 3 + 39.V22 - 6 O.V 2 X 3 + 30x3^] 

F 3 = :|',i[49vr - ^xiX2 - 80ariX3 -h A6x2^ + 40x2.V3 4- 40ac32] 

F4 = + 2V'^.ViX2 - X2^ 

6. Find the lengths of the principal semiaxes of the ellipsoid represented by the 
quadratic form: 

F = 177.vi“ 4" 228.vijr2 120 jvi.V 3 348.V2" 4 b0.r2a‘3 4 IO 5 .V 3 " = 45 
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7. Compute the modal matrix of the quadratic form given in Prob. 6 and make an 
isometric plot showing the positions of the principal axes. 

8. Show that the quadratic surface 

2jri‘ -h 5.1*2* - b.r2.V3 + 5.V3- = 2 

represents an ellipsoid of revolution. Compute the lengths of the .semiaxes and find 
their direction cosines with reference to the given co-ordinate system. Write down the 
modal matrix. Is it unique? 

9. Reduce the following quadratic forms to sums of squares through orthogonal 
transformation and give the respective transformation matrices: 

F\ = v 3.5 a*i- — 2.V 1.1*2 + .V1.V3 4- -h O.4.V2.V3 -h 1.9.V3“ 

^2 = - 284.vi.vo - I88.V1.V3 + 181.r2“ - 256.r2a:3 - Sx^^] 

Fz = — 2.6.V1' -f 2.4.V1.V2 + O.6.V2 

Fi = 0.4 (.vr 4- \ J.vi.vi 4- 2.r2-) 

10. Reduce the following quadratic term to its normal form: 

F = 0.487.V1- — 0.784.V1.V2 4- 0.632.vi.V3 4- 0.445.vi.V4 4- 0.487.V2- 4* 

4- 0.445.va.V4 4- 1.892x32 -f O.IIIX42 

What is its rank? 

1 1 . Find the orthogonal matrix which transforms 

F = 0.392xi' - 0.948.ri.r2 - 0.632.viX3 4- 0.444xi.r4 -f 0.392.r22 — 0.632x2.r3 

- O.444.V2.V4 4- 1.892x32 4*0.111x42 

to the normal form. Find the latent roots and the signature. 

12. By forming the Gramian determinant, establish the linear dependence or 
indep>endence of the vector set defined by the row matrices: 

[1 3 4 5 1 3] [2 3 4 5 6 7] [1 6 8 10 -3 2] 

Repeat for the set 

[1 -1 4 8 2] [3 -3 -2 -10] [4 2 5 8 -8] 

13. By means of elementary transformations find a matrix S which will reduce to a 
sum of squares the quadratic form 

F = 4xi2 4- 5xi.r2 ~ 2^1X3 + 7 x 2 ^ -f Jx^xs -f 2x32 

through a congruent transformation of its matrix Q. Determine the corresponding 
diagonal matrix Q)i and the signature. 

Through a similar procedure find another matrix which also yields a reduction to 
a sum of squares. Express the relation between the two diagonal forms 5D and 3)' of 
the reduced matrices by computing the diagonal matrix A in the equation 
3)' - At3)A. Show that it is always possible to find such a real A relating the alternate 
diagonal forms of a symmetric matrix. 

14. Using the following vector set, 

[1 1 -1] [-3 3 5] [-5 0 5] 
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form a matrix Q whose elements are computed as are those in the Gramian determi- 
nant. Find the latent roots of Cf and check the positive character of the principal 
minors of its determinant. 

15. If possible, reduce each of the following quadratic forms to its canonic form 
and give the corresponding modal matrix as well as the transformation required to 
pass from the normal to the canonic forms. 

Px = [ 83 xi 2 -f 52 .riX 2 - ^Oxix^ -f 122.^2“ -f- 20.r2:c3 + 65 x 3 ^] 45 -^ 

Fi = [49x1“ — 4xi.r2 — 86 x 1 X 3 -f 46x2“ + 46 x 2 X 3 -f 46x32]45~^ 

16. Determine whether or not the following quadratic form is positive definite: 

F = v?xi“ - 4 xi.V2 - 4x1X3 + 5x2“ + 2x2X3 4- 4x3^ 

17. Find the triangular matrix that generates the matrix Q of the quadratic 
form 


F = X 1 “ — 6x 1X2 - 2x 1X3 -|-4x 1X4 4- 13x2“ 4“ 16x2X3 — 26x2X4 4- 1 1x3“ - 14x3X4 4-25x4^ 

and check the relation Cf = 

18. (a) Find the extrema of the quadratic form 

F = :S[ — 7xi“ — 16 \'^xi.X2 4 - 3x2“] 


when the variables xi, X2 are subjected to the condition that the point jP(xi^ 2) shall 
lie on the unit circle. 

(b) Find the extrema of 

F = -^[4x1“ 4- 4x1X2 f 4x1x3 4“ 149x2“ - 278x2X3 4* 149x3^ 


subject to the condition that P(xi,X2,X3) shall lie on a sphere of radius 2. Determine 
the signature of this quadratic form. 

19. I^t G be the matrix of a positive definite quadratic form, .4 its discriminant, 
and 9 a triangular matrix which generates G according to the relation G = 

Show that a possible procedure for the formation of is given by the relations: 
'Pkj ~ 0 for /fe > y ; pKj ^0 for k ^ 7. 


pij = ± 

pkj = ± 




p2j = ± 


Bli 


V ail 
v7i ...^A:_l)7Cl...Af 


\ aii5i2 
Pnn ~ 


pZi =dr 


Ci2; 


V -B12C123 


ViVfl...(n-l) 


Here ai, are the elements of the first row of G. The Bi, are second-order minors 
formed from the first two rows of G by selecting the first and the 7th columns. The 
Ci2y are formed from the first three rows of G by selecting columns 1,2, and 7; and so 
forth. 

26. By using the procedure given in the preceding problem compute a triangular 
matrix which generates the matrix of the quadratic form given in Prob. 17. 

21. Show through a proper expansion of the characteristic determinant of a 
matrix that its characteristic equation can be written in the form 
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(— X)" + (— X)”"' 2 (principal diagonal elements) 
+ ( -X)"”- 2 (principal minors of order 2) 
+ ( — X)"“^ (principal minors of order 3) 


— X2 (principal minors of order » — 1) 

-1- determinant = 0 

In view of this result, show that the sum of the latent roots of a matrix equals the 
sum of its principal diagonal elements, and that the product of the latent roots equals 
the determinant. Show further that if the determinant is zero, at least one latent root 
is zero; if the matrix is of rank n — 2, at least two latent roots are zero; and so on. 

22. Using the result of the previous problem, show that the roots of the equation 
X** + Un-iX"-^ 4- . . . 4- jjX 4- <io = 0 are the latent roots of the matrix 


0 

1 

0 

0 • 

• 0 

0 1 

0 

0 

1 

0 •• 

• 0 

0 

0 

0 

0 

1 •• 

• 0 

0 

0 

0 

0 


0 

1 

Jo 

~Jl 

-(12 

• • • 

. . . 



23. Let Q be an «th order square matrix and Xi, Xj, • • • Xn its latent roots. The 
mth power of (J is denoted by 5 j 6. 

(a) If the latent roots are real and distinct, and numbered so that |Xi| > |Xj| > 
• * * > iXnl, show that for a sufticiently large w: 

|Xih ^ 2 (principal diagonal elements of B) 

!XiX2i”* ^ (principal minors of order 2) 

1X1X2X31’" ^ 2 (principal minors of order 3) 




(b) If the latent roots are real but there are a repeated roots so that 
iXi| > IX 2 I > * • • > [Xj,| > |Xj;^l| = |Xp4.2l = • • • = |\^fa| > |Xj;^a+l| > * * • > |Xn| 


derive the relations 


|X,|"s 


(principal minors of order j) 
2 (principal minors of orders — 1) 

(principal minors of order g -f 1 ) 
a 2^ (principal minors of order g) 


y = l,2, • • •g;g-fa-fl,g-ha+2, - • 


(c) Suppose, in the sequence |Xi| > IX2I > • • • > |X„|, that the first h - 1 roots 
are real but that the roots \h and Xa+i are conjugate complex with the angle 0*. In 
this case show that 


lx*!*" 


21 (principal minors of order A + 1) 
2^ (principal minors of order /r — 1) 


and 


^ . T) (principal minors of order h) 

2 X* cos m^H ^ 7 — ^ — 7 tt 

‘ ' 2- (principal minors of order h - {) 
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24. If F is a positive definite quadratic form in n variables, show that its character- 
istic determinant, when expanded as a polynomial in X, consists of w -f 1 terms with 
alternate algebraic signs, and that the same polynomial with all terms alike in sign is 
associated with a negative definite form. 

n 

25. Consider the quadratic surface aaXiXk = 1 and the plane 

^ AkXh = 0 
A =1 

(a) Show that the intersection of the plane with the surface can be expressed as a 
quadratic surface of n — 1 dimensions in the form 

2 ) br»x'rx\ = 1 
r.Ji =1 

and compute the values of the coefficients hrt. 

(b) As a numerical illustration find the equation of the ellipse given by the inter- 
section of the ellipsoid of revolution 

Xi^ . X2- + Xi- , ^ ^ , 

—5- H = 1 wnth a = V 2, & = 1 

a* 6- 


and the plane 


— 2xi -h 3 x 2 + X3 = 0 


Find the lengths of the semiaxes of this ellipse and compute their direction cosines. 

26. Given the quadratic surface 

n 

2 ^ QikXiXk — 1 
' i.k^l 

and the parametric equation of a straight line 

X/ ^ajt + Pi y = 1,2, ---n 

in which a, and jS, are constants and / is a variable parameter. 

(a) Demonstrate that the line cuts the surface at most in two points and that the 
corresponding parameter values must satisfy the eq’.ation 

( 2 + if / = 1 — 2 ^ dikPSk 

Discuss the conditions under which there are 2, 1, or no points of intersection. 

(b) Compute the co-ordinates x\ and x"* {k = 1, 2, • • • n) of the points of inter- 
section. 

(c) If the straight line passes through the origin, show that 


x\ 


— X * = 


Otk 



aikctm 


and interpret this result according to whether the quadratic form is positive definite or 
not. 

(d) Write down the corresponding expressions appropriate to a quadratic form 
representing a cone. 
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27. Carry through the simultaneous reduction to the normal form of both quadratic 
f >rms in the following pairs: 

(i) Fi = 2xr + 2.V1.Y2 + xr F 2 = xr + 2 xiX 2 - X 2 ^ 

(ii) Fi = 2xr 4- 2xiX2 -f 2.t 1X3 -h 3.r2- - 2x2X3 + 2x3* 

Fa = xi^ + 3x2^ - 2x2.r3 4- 2x2“ 

(iii) Fi = 2x1" 4- 6x1X2 4- 5x2“ F 2 = 2xiX2 

28. Compute the matrices and fP (see Eq. 101), and the matrices 

and for each of the three reductions in Prob. 27. 

29. Let a 11X1- 4- U22.Y2‘ = 1 be the equation of an ellipse when referred to a system 
of oblique axes making an angle <f>. 

Show that the principal axes of the eDipse make angles a i and aa with the xi-axis 
that are determined by the relation 

(m^ — 1) ± — 1)“ + 4w^cos“0 

tanai.2 = 

(m- 4- 1) :t V (w“ + 1)^ — 4m" sin*0 


in which m == un '<222; and that these axes can have any desire<l orientations through 
the proper choice of the ratio m. Ob.serve that the angle betwt^en the axes is 90®. 

Write down the expression for a family of ellipses for which one principal axis 
coincides with the x^axis. 

30. Consider the pair of quadratic forms: 

Fi = 3x,* - iv'lxixt + 2x1* and Ft = xi* + 2\/2xixt - 2x,* 

Regard the variables xi and X2 as the contravariant co-ordinates of a point F referred 
to a system of axes for which the fundamental metric tensor has the matrix 

!] 

(a) Make a plot of the curves Fi = 1 and F2 ~ 1 in this system of reference 
co-ordinates. 

(b) Find the transformation of variables which simultaneously reduces both 
quadratic forms to their normal forms. 

(c) Find the orientations of the new co-ordinate axes relative to the old ones. 

(d) Compute the transformed fundamental matrix @. 

(e) Find the orientations of the principal axes of the resultant ellipse with respect 
to the new co-ordinate system. 

31. Consider the pair of quadratic forms 

Fi - xi* — 2 xiX 2 4- 2x1X3 4- 2x2* — 2x2X3 -f 3X3® 

F2 = 3 xi® — 2x1X2 4 - 2x1X3 — X2* ~ 2x2X3 4 “ 

in which xi, X2, X3 are contravariant variables with respect to an oblique co-ordinate 
system consisting of a triad oi axes whose mutual angles are all 60®. 

Reduce both Fi and F2 to their normal forms through a transformation to a new 
co-ordinate system and find the orientations of the new co-ordinate axes with respect 
to the old ones. 
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32. Find the equation of the intersecting curve between the quadratic surface 

83jci2 + S2xxX2 - 40:ci.T3 + Mlx^^ + 2Qx^^ + 65x8^ = 45 
and the plane 

5.ri + 4.V2 - Sjca = 0 

and compute the lengths of the semiaxes of this curve as well as their orientation with 
respect to the assumed rectangular Cartesian co-ordinates. 

33. For the ellipsoid given in Prob. 32 find equations of the three planes containing 
the principal axes taken in pairs. 

34. Find the maximum perpendicular distance from points on the ellipsoid of 
Prob. 32 to the plane 

— 2^1 d" ^2 “h 2xz — 0 

35. Given 

F = i’rJ21.vr -f 24xiX2 -f i2O.r1.v3 + 39jr2- - + ^Ox^^] 

Find the abridged quadratic form that results if the variables are subjected to the 
condition that they determine a point which is constrained to lie on a plane normal to a 
vector with the components 1, 2, —1, and compute the latent roots of this abridged 
form. 

36. The rectangular Cartesian co-ordinates forming the reference system for the 
ellipse 

3x1^ + 2x2^ -h == 1 

are subjected to a transformation to a new set of axes defined by the orthogonal vector 
set 


1 

1 

U 

■ 2 1 

-2 “ 

To ^ 

v/sl 

L\'5 3\5j 

.V5 3 

” 3V^5_ 

r 3 

3 _ 


Find the equation of this ellipse in terms of the new variables and make plots in the 
new co-ordinate planes of the intersections of the ellipse with these planes. 

37. The real quadratic form 

n 

F = 2 f^ikXiXk 
i,k=l 

is subjected to the set of / linear constraints 

L prkXk =0 r = 1, 2, • • • / 

Jk-l 

and the condition 

I 

In terms of the function • 

F = G= F+x('i - J) Xjt a + 2 2 £ t^rprkXk 

\ ]l=l / r=lfc=l 

in which X and mi * * * are Lagrangian multipUers, show that the conditions fer >n 
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extremum of F subject to the constraints lead to the equations 

n t 

Xl (iikXi - \xk + E MrM =0 = 1, 2, • • • n) 

1=1 r=l 

E prkXk =0 (r = 1, 2, • • • /) 

it = i 

t X.2 = 1 

A=1 

Show that this system of equations is sufficient to determine the unknowns 
.vi • • • .r„, Ml ‘ * • and that the extremum of F is given by f = X in which the 
X-values are roots of the equation 


(till — X) 
<121 

tl 12 

(<122 — X) • 

In 

U2n 

pll • 

Pl2 ‘ 

• pn 

• Pt2 


Onl 

Un2 

(Ufift X) 

pin • 

• pin 


Pll 

P\1 

pin 

0 • 

> 0 

= 0 

P 2 I 

P 22 * * 

p2n 

0 • 

• 0 


Pii 

Pt2 

Pin 

0 • 

• 0 



38. Show that the determinant given in Prob. 37 furnishes the same latent roots as 
those corresponding to the abridged quadratic form. Following a method similar to 
that given in Prob. 21, obtain expressions for the coelVicients of the various powers of 
X in this characteristic equation. 

39. Illustrate the proceflure outlined in Prob. 37 with the quadratic form and the 
single constraint given in Prob. 32 and compare the results with the solution to that 
problem. 
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Vector Analysis 

1. Preliminary remarks and definitions 

The quantities considered in this chapter are functions of the co- 
ordinates of ordinary (three-dimensional) space. In some cases, they also 
may be functions of other independent variables such as the time. 

A scalar is a function which, for each set of values of its independent 
variables, is completely characterized by a corresponding magnitude. 
If the function is defined for all points within a given region, it is there 
said to constitute a scalar field. Potential functions, such as the scalar 
potential in an electric field or the thermodynamic potential of an ideal 
gas, are common examples. The geographical altitude as a function of 
latitude and longitude is a two-dimensional example of a scalar field. 

A vector is a function which is characterized at each point in space by 
means of a magnitude and a direction. If the function is defined for all 
points within a given region, it is there said to constitute a vector field. 
The earth s gravitational field of force or the velocity field of a fluid are 
familiar examples. The magnitude of a vector function is a scalar. The 
vector function may, therefore, be thought of as a scalar to which a 
direction is assigned at each point in space. 

More specifically, however, two kinds of vector functions are dis- 
tinguished according to their processes of derivation. Thus, for example, 
the gradient of a scalar potential function is a vector. A simple example is 
the gradient in a mountainous terrain. A v'ector of a physically different 
nature is that used to represent a mechanical torque. The torque is pro- 
duced by a force acting upon a lever arm, a? i the resulting vector (by 
convention) stands normal to the plane determined by the force and the 
arm; that is, it coincides with the axis of rotation. The direction of the 
torque vector, moreover, must be defined in accordance with a right- or a 
left-hand screw rule. 

These two types of vectors, such as a gradient and a torque, are dis- 
tinguished respectively by the adjectives polar and axial. This distinction 
is not merely a superficial one which may be disposed of by the simple 
process of propounding a pair of suitable adjectives. One reason for 
making such a distinction is brought to light when the variables of these 
vector functions are subjected to a co-ordinate transformation, such as 
changing from a right-hand to a left-hand system of rectangular axes 
(see Fig. 1). In this case, the algebraic sign of the axial vector function is 
reversed and that of the polar vector is not. If both types of vector func- 
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lions arc involved in a given problem, this circumstance must be care- 
fully considered. 

An axial vector may be the result of a vector product formed from two 
given vectors. It must be observ’ed, however, that this is the case only if 
both the given vectors are either axial or p>olar. The vector product 
formed from a polar and an axial vector is polar. 

Since a scalar may be the result of a scalar product of two vectors, it 
appears that this question regarding the distinction between two types 


z 




left-haxid cartesian axes 


Fig. 1. Two systems of cartesian a.xes. 


of vectors is not confined to vector functions. Thus a scalar function 
which results from the scalar product of a polar and an axial vector has 
different mathematical properties from those of a scalar function which 
■ is the scalar product of two polar or two axial vectors. The first of these 
functions reverses its algebraic sign when subjected to a transformation 
from a right- to a left-hand co-ordinate system; the second does not. The 
latter is invariant to any co-ordinate transformation, as a true scalar 
should be. An energy function is a scalar of this type. The other kind of 
scalar function, which is also encountered in physical problems, is called 
a pseudoscalar, since it has all the properties of a scalar except that it is 
not invariant to certain types of co-ordinate transformations. 

^ The product of a scalar and a vector yields a vector of the same type. 
Multiplication with a pseudoscalar, however, changes an axial vector 
into a polar one, and vice versa. It should also be observed that the addi- 
tion or subtraction of vectors or scalars should not be carried out without 
regard to their type or origin. 

In order that the geometrical visualization of it may be facilitated, a 
field is commonly pictured as associated with a system of so-called flow 
lines. In hydrodynamics, for example, these are the paths which are 
traversed by the component particles of the fluid. Since the velocity 
vector is tangent to these trajectories at every point within the flow 
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region, certain physical characteristics of the vector field itself can be 
recognized from such a system of flow lines {Jlow map)^ 

The magnitude of the function representing the vector field at any 
point is given by the density of flow lines in the surface normal to the 
vector at that point. The number of lines chosen to represent unit density 
is, of course, arbitrary, but it is significant to observ^e that since the 
magnitude is a continuous function (except at certain points or surfaces 
where lines begin or end), all points within the field must be thought of 
as occupied by lines. 

A given bunch of these lines, moreover, cannot intertwine with each 
other, because their continuous distribution would then necessitate a 
crossing of lines at some points, and this is impossible since their direc- 
tions must ever>'whcre be unique. The lines defining the longitudinal 
surface of a given bunch form what is known as a tube. Unless such a 
tube contains regions from which lines emanate or upon which they 
terminate, the total number of lines enclosed by it must evidently 
remain constant throughout the flow region defined by the tube. 

From this more physical point of view, vector fields are distinguished 
according to either of two characteristically different (yet in a sense 
complementary) properties which they may separately or simultaneously 
possess. Thus if the flow map exhibits lines which close upon themselves 
(arc endless), the field is said to exhibit turbulent characteristics. If none 
of the flow lines close upon themselves, the field is said to be fionturbu- 
Icni.f In connection with the latter statement it must be recognized, of 
course, that a flow map for a finite region may exhibit no closed paths, 
yet the greater field may be turbulent, for some of the paths may close 
outside the finite mapped region. 

A vector field which is solely turbulent (alternatively called rotational 
or solenoidalX) is associated with a flow map containing closed paths 
only. Figure 2 shows an example of such a field. A nonturbulent field 
(also called an irrotational or a potential fielfi) is associated with a flow 
map in which all the lines begin at a source and end upon a sink (or nega- 
tive source). For this reason, the potential field is sometimes referred to 
as a source field, and the turbulent one as a source-free field. Figure 
3 illustrates an irrotational field. 

In the sense that a source is the cause or origin of any field, the turbu- 
lent field must, of course, also have its sources. These, however, are 

♦These matters are discussed in greater detail in Electric Circuits, pp. 23-71. 

fThe term lamellar is sometimes used to describe such a field. 

X\ solenoid Is a channel or tube. The term “ solenoidal ” does not appear to be particularly 
appropriate because tubes of flow lines can also be mapped in a potential field. The important 
characteristic of the rotational field is the fart that its tubes close upon themselves, each 
forming an endless c-onduit containing the same number of flow lines throughout the length 
of its circuit. 
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referred to as vortexes. They are the whirlpools or eddies in which the 
field has its seat or origin. A turbulent field is sometimes also called a 
vortex field. 

An arbitrary vector field can exhibit both turbulent and nonturbulent 
characteristics, caused by the simultaneous presence of sources and 
vortexes. The turbulent and nonturbulent components of this field are, 
however, linearly independent. In other words, an arbitrary vector field 



Fig. 2. A vector field which is solely turbulent. 

may always be represented as the linear superposition of two independent 
components, one of which is a purely turbulent and the other a purely 
potential* field. 

These matters, together with a number of useful vector operations and 
their interrelations as well as the geometrical interpretations of them, 
are discussed in the following articles. One of the various operations 
encoimtered here is the linear transformation discussed in Ch. III. The 
coefficients of this transformation (elements of its matrix) may be 
functions of the space co-ordinates, so that for each point in space, a 
particular vector transform is associated with any given vector. This 

•This rather common designation for a nonturbulent field is appropriate because a vector 
function representing the gradient of a scalar potential is inherently nonturbulent, as is shown 
in detail sulx^equently. 
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transformation function is called a tensor*^ of valence 2. The order of the 
tensor is that of its matrix. For ordinary space, therefore, the tensor is 
of the third order. The coefficients of the transformation are referred to 
as the components of the tensor. 

A tensor of higher valence is a function which, at every point in space, 
associates a tensor of the next lower valence with any given vector. A 



Fig. 3. A nonturbulent field. 


tensor of order « and valence ? has compohents. For example, a tensor 
of the third order (for ordinary space) and valence 3, has 3® or 27 com- 
ponents. Its matrix may be regarded as a three-dimensional array. In 
this classification, a vector is sometimes referred to as a tensor of valence 
l,t and a scalar as a tensor of valence 0. 


•The name tensor originated when this kind of function was first used in connection with 

problems dealing with stresses in elastic media. . . . - . » • 

tin this interpretation of a vector, its components are regarded as defining a row matrix, 
and the linear transformation (tensor) corresponding to this matrix is a single Un^r equation 
like one of the equations of the set 3, Ch. III. This transformation is seen to transform a given 
vector X into a scalar (single component), but the classification of a vector as a tensor of 
valence 1 nevertheless appears to be Inaccurate because it confuse the vector mth a trans- 
formation. It would be more proper to say that the components o. a vector (not the vector 
itself) may be regarded as those of a tensor of valence 1. 
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Since the analysis of a physical problem usually requires a co-ordinate 
system of some kind, the various vector operations discussed in this 
chapter are expressed not only in vector form but also in terms of an 
assumed system of co-ordinates. In most of the detailed formulations, 
the rectangular Cartesian system, being the simplest, is chosen. Unless 
mention is made to the contrary, a right-hand system of axes is assumed. 
The latter is so named because a right-hand screw, turning in the direc- 
tion of the shortest route from the positive x-axis to the positive y-axis, 
advances in the direction of the positive s-axis. Transformations of the 
important vector operations to some of the orthogonal curvilinear co- 
ordinate systems more frequently encountered in practical problems are 
given in a subsequent article. 

2. The scalar product 

The scalar product of two vectors A and B (also called the inner 
product) is defined as the product of their magnitudes multiplied by the 
cosine of the angle between them. The scalar product is denoted by a 
dot placed between the s>Tnbols A and B\ hence 

A-B=\A\ \B\ cos 6 [1] 

in which 6 is the angle included between the two vectors.* 

According to this definition, the scalar product may alternatively be 
regarded as the product of the length of either vector with the projection 
of the other upon it. If one of the vectors, for example, B, is given by the 
vector sum (according to the parallelogram law of addition) of two other 
vectors, as in the expression 

B = C + D [2] 

then 

AB = A {C + D) = AC + AD [3] 

This result is seen to be true because the projection of B upon A is evi- 
dently equal to the sum of the projections of C and D upon A . Hence the 
distributive law holds for scalar products. 

When it is necessary to express the scalar product in terms of the 
components of the vectors A and B with reference to a rectangular 
Cartesian co-ordinate system, it is convenient to define a set of unit 
vectors having the directions of the x-j y-, and 2 -axes. These unit vectors 

^According to this notation, which is attributed to Gibbs, the scalar product is sometimes 
also referred to as the dot product. An alternative notation quite frequently found in the 
literature is to indicate the scalar product by enclosing the symbols for the two vectors in 
parentheses, thus: i4 • 5 » {AB). 
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arc denoted respectively by the letters and k. The components of the 
vectors — that is, their projections upon the x-, y-, and z-axes — are de- 
noted respectively by Ay, A and By, J5^. 

The vectors themselves may then be written 

A = iA^ +jAy + kAz [4] 

and 

B = iBx + jBy + kBz 

The terms in these equations arc vector components, and the right-hand 
sides represent vector sums. The scalar product of A and B may now be 
written 


A B = {iAz +jAy + kAz) (iB, +jBy -f- kBz) [5] 


Since the distributive law holds, the right-hand side of Eq. 5 may 
be replaced by the sum of nine component scalar products such as 
{iAz) • (iBjc). {iAz) • ( jBy)y etc. The unit vectors i, j, k are mutually at 
right angles to each other. Hence the scalar product of any one of these 
with any other one is zero, and the scalar product of any onq with itself 
is unity; that is 


i*i=j'j = k'k = 1 


[ 6 ] 


Consequently, only three of the nine component scalar products resulting 
from Eq. 5 have nonzero values, so that 

A ' B = AxBx + AyBy “1- AgBz [7] 

The scalar product is thus given by the sum of the products of the corre- 
sponding compx)nents of the two vectors. This result is true only for a 
rectangular co-ordinate system (discussed in Art. 4, Ch. Ill) and no 
longer holds when the co-ordinate axes make oblique angles with each 
other. 

It is clear from the definition of the scalar product that the commutative 
law holds; that is, 

A B^ B A [8] 


Since the scalar product of two vectors yields a scalar, a triple scalar 
product such as might be denoted hy A - B • C has no meaning, and the 
question whether the associative law holds hence does not arise. 


3. The vector product 

The vector product of two vectors A and B (also called the outer 
product) is defined as a vector whose magnitude is given by the product 
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of the magnitudes of A and multiplied by the sine of the angle between 
them. The direction of the vector product is related to the directions of 
A and B by the right-hand screw rule in the sense that a right-hand screw, 
turning in the direction of the shortest route from the tip of A to the tip 
of B (assuming that these emanate from a common point*), advances in 
the direction of the vector product. 

In connection with this definition, it is significant to observe that the 
angle 6 betw’een A and By which enters into the determination of the 
magnitude of the vector product, is that angle through which the right- 
hand screw defining the direction of the resulting vector is turned in 
passing from the tip of A to the tip of B. This is usually taken to be the 
smaller of the two supplemental angles through which it is p)ossible to 
turn in passing from one to the other of any two coterminous vectors A 
and B. The result, however, is the same if the larger of these two angles 
is chosen, because the direction is then reversed and the algebraic sign of 
the magnitude is reversed also. 

In the Gibbs notation, the vector product is denoted by a cross, placed 
between the symbols for the two vectors, t Thus the magnitude of the 
vector product is expressed by 

\A ^ B\ = |i4| |J5| sin 6 [9] 

This expression is recognized geometrically as being numerically equal to 
the area of the parallelogram defined by the tw’o coterminous vectors A 
and B. 

If the cosines of the angles between the normal to the surface of this 
parallelogram (in the direction of the vector A ^ B) and the x-, y-, and 
2 -axes of a rectangular co-ordinate system are denoted respectively by 
cos («,x), cos («,y), and cos («, 2 ), then 

Vx = \A ^ B\ cos (»,x) 

Vy = |i4 cos («,y) [10] 

Vt = \A>^B\ cos (n,z) 

represent the projections of the area of the parallelogram upon the yz, 
zXy and xy planes respectively. Since 

cos^ {tiyx) + cos^ (»,y) -f cos' (», 2 ) = 1 [11] 

^In other words, the vectors are for the moment assumed to be coterminous, as they 
usually are. The definition of the vector product (and the scalar product also) is, however, 
independent of whether the vectors are coterminous. If they are not, then the direction of 
the vector product may perhaps be more easily visualized by first displacing one of the vectors 
parallel to itself untfl the two become coterminous. 

fFor this reason, the vector product b sometimes also referred to as the cross- prodwL 
An alternative method for indicating the vector product b to enclose the symbols for the two 
given vectors in square brackets, thus: A^B ^ lAB], 



Art. 


THE VECTOR PRODUCT 


191 


it follows that 

M X -B| = \/VT^rvJTv7 [12] 

and hence K*, Vy, and are seen to be the components of the vector 
product 

V = A^B [13] 

In other words, the component areas given by Eqs. 10 are recognized 
as having the properties of vector components, and their vector resultant 
is identified with the vector product as defined above. 

Observ'e in this connection that a component area (projection of the 
surface of the parallelogram determined by the vectors A and B upon one 
of the three co-ordinate planes), as defined by one of the three Eqs. 10, 
is not merely a geometrical projection, but in addition involves an alge- 
braic sign which reverses if the direction of the normal is reversed (re- 
placing the angles of the cosine functions by their supplements). That is, 
the algebraic signs of the components 10 are controlled by the right-hand 
screw rule for the vector product. 

If the projections of the vectors A and B upon the yZj zXj and xy planes 
are denoted by A^^^\ B^^^\ and so forth, it follows from these considera- 
tions that the vector comp)onents of V are given by 

iV^ = A^^^^ X 

= [14] 


The magnitude of one of these components, such as Fx, for example, is 
given by 

7* = [15] 

in which Byt is the angle included between and Replacing this 
angle by the difference between the angles which these vectors separately 
make with the y-axis, applying the trigonometric identity for the sine of 
the difference between two angles, and noting that the components of 
and are those of A and B on the Y and Z axes, it is found that 

7* = AyB, - A,By [16] 


and similarly that 

Vy = A,B, - A,B, [17] 

and 

7* = A:,By - AyB, [18] 

These are the components of the vector product expressed in terms of 
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those of the two given vectors A and B, If the vector B is the resultant 
of two other vectors, that is, if 

B I) [19J 

the decomposition of C and D into their rectangular components, and 
substitution into Eqs. 16, 17, and 18 show that 

V == A - B == A > {C + D) A ^ A - D [20] 

Hence it follows that the distributive law holds with regard to the vector 
product. 

Conversely, if the distributive law is assumed to hold, the vector 
product 

A-B ^ (iA^ +jAy-h kA,) X (/Bx + jBj, + kB,) [21] 

may be replaced by the sum of nine component vector products. Ac- 
cording to the definition of the vector product, 

=j.j = k»k = 0 [22] 

and 

ixj = k = 

j,k = i=-k^j [23] 

k^i — j = —i*k 

so that three of the nine terms represented by Eq. 21 become zero, and 
the remaining six yield 

A X B = i(AyB2—A2By)-{-j{AtBx—AjrBg)-^k{AxBy—AyBr) [24] 

This result is seen to agree with that stated by Eqs. 16, 17, and 18. 

It is useful to recognize that the vector product may be written in the 
following determinant form : 

^ i j fe ' 

A-B^ A^ Ay A^ [25] 

Bx By Bg 

The Laplace expansion of this determinant in terms of the elements of 
its first row yields the vector product in the form given by Eq. 24. 

Although the distributive law holds for the vector product, the com- 
mutative law evidently does not, since 

A * B [26] 

The triple vector product 

AxBxC [27] 

has a unique meaning only when the order in which the products are to 



Art. 4\ 


THE SCALAR TRIPLE PRODUCT 


193 


be carried out is indicated. Thus in the association indicated by 

[28] 

the product B^C is formed first, and then the vector product of A with 
this vector is determined. The result is normal to B and hence is a 
vector which lies in the plane determined by the vectors B and C. In the 
association indicated by 

[A-B]^C [29] 

the product [A ^ B] is formed first, and then the product of this vector 
with the vector C is determined. The result in this case must be normal 
to A ^ B, and hence is a vector which lies in the plane determined by 
the vectors A and B. 

Evidently 

x[5xC] [A-B]-C [30] 

so that the associative law does not hold for multiple vector products. 

Since the resultant vector for the triple product 28 lies in the plane of 
the vectors B and C\ it must be |X)ssible to e.xpress this vector as a linear 
combination of B and C’, that is, 

A-[B-C]-- pB + yC [31] 

An evaluation according to the form given by Eq. 24 for the vector 
product shows that 

A-[B-C] {A-C)B - {A- B)C [32] 

which agrees with Eq. 31, in which 

= {A 'C) and y = —{A-B) [33] 

‘ The triple product 29, on the other hand, is 

[^ xB]xC = -Cx[^xB] C-[B^A] [34] 

This has the form of the triple product in Eq. 32 with A and C inter- 
changed. Hence 

[^x5]xC= {A-C)B- {B-C)A [35] 

which is a vector lying in the plane determined by the vectors A and B, 
as stated above. 

4. The scalar triple product 

The following combination of a vector and a scalar product 

A[B-C] [36] 

in which A , J5, and C are arbitrary vectors, is called a scalar triple product. 
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The definition of the scalar product in the form given by Eq. 7, together 
with the determinant form 25 for the vector product, shows that the 
scalar triple product may be expressed as the value of the determinant 


A-[B^C] 


Ar A,, A^ 
Bx By Bg 
C Cy Cg 


[37] 


The result is, of course, a scalar. 

Since the transpose of a determinant has the same value,* the rows in 
the determinant 37 may be written alternatively as columns. The value 
reverses its algebraic sign when any pair of rows are interchanged, but it 
remains unchanged if this interchanging is done twice in succession. Since 
the cyclic order of the letters Ay By C can be changed to By Cy A by two 
interchanges, and to Cy Ay B by two more interchanges, it follows that 

A [B-C] B^[C-A] =C.[^^5] [38] 



hi B projection vector Aupon BxC 
IBxCiBarea of parallelogram OBPC 
Fig. 4. A geometrical interpretation of a scalar triple product. 

A geometrical interpretation for the scalar triple product is readily 
given as shown in Fig. 4. The magnitude oi[B*C] equals the area of the 
parallelogram determined by jB and C;its direction is normal to the plane 
of this parallelogram. If the scalar product is interpreted as the length 
of the vector [B^C] multiplied by the projection of A upon it, the scalar 
triple product is seen to be equal to the product of the area of the 
parallelogram determined by B and C, multiplied by the component of A 
normal to this surface. The result evidently represents the volume of the 
parallelepiped of which three coterminous edges coincide in length and 

*See Art. 2, Ch. I. Specifically, this is there stated as the property VIII. 
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direction with the three vectors A, B, and C. The parallelogram deter- 
mined by B and C is regarded as the base of this parallelepiped, and the 
normal component of A is its altitude. In the equivalent forms given by 
Eq. 38, the base of the parallelepiped is alternatively regarded as defined 
by the vectors C and A^or A and B. These alternate interpretations are 
illustrated in Figs. S and 6. Each of the three expressions in Eq. 38 
represents the volume of the same parallelepiped. 



A] ■ projection of vector B upon Cx A 
ICx A U area of parallelogram OCQA 


Fig. S. Alternate interpretation of 
a scalar triple product. 


triple product A Zrf 

C-(AxB) /! /! 



A 3 a: projection of vector C upon A xB 

i A X BUarea of parallelogram OBSA 

Fig. 6. Alternate interpretation 
of a scalar triple product. 


In connection with this scalar triple product, observe that the brackets 
enclosing the vector product of B and C in the expression 36 are signifi- 
cant because they indicate the order in which the operations are to be 
carried out. Thus, if this triple product were written A • B^C, it might 
be thought that A • B could be carried out firj^i if desired. This procedure, 
however, yields a scalar, and the subsequent’ vector product with C then 
has no meaning. 

5. The gradient 

A single-valued scalar function of the space co-ordinates x, y, z is 
denoted by the symbol U. It is a function of position or location only. 
The points in space at which U has a given value, for example, C, define a 
surface which is referred to as a constant-value surface. Any number of 
such surfaces, for various assumed values of the constant C, may be 
mapped. In particular, it is expedient to map a series of constant-value 
surfaces for values of C which differ by integer multiples of some chosen 
interval. A familiar example of such a map is that for the two-dimensional 
altitude function in a geographical terrain. Here the function U is constant 
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along lines instead of surfaces. These are called contour lines, and the 
resulting plot is spoken of as a contour map. 

Such a map places the variation of the function U in evidence, since 
the function e\ddently changes slowly in those regions where the lines or 
surfaces are far apart, and rapidly where they are closely spaced. The 
rate at which U varies in any given direction at a point in space is 
determined approximately by the ratio which the inter\^al chosen for the 
constant C-values has to the distance measured between two neighlx>ring 
surfaces in the given direction at the point in question. If the interv'al in 
the C-values at this point is allowed to become smaller and smaller, the 
corresponding limiting value of the ratio accurately yields the desired 
rate of change of U. This value is called the directional derivative of U, 

It is apparent that the directional derivative of C is a maximum at^a 
given point if the derivativ e is taken in a direction normal to the constant- 
v'alue surface passing through that point, because the distance between 
neighboring surfaces is evidently smallest in the normal direction. This 
maximum value of the directional derivative is called the normal deriva- 
tive of U, 

As a function of the space co-ordinates, the normal derivative of U 
appears to have the prop)erties of a vector function. The truth of this 
statement may be seen from the fact that if dn is the differential distance 
in the direction of the normal between two neighboring constant-v'alue 
surfaces for which C differs by dC, and if ds is the distance between these 
surfaces in any other direction s, then, except for differentials of higher 
order, ^ 

dn ^ ds cos $ [39] 


where 0 is the angle between the normal and the direction s. It follows that 


dU dU dn dU 

“T" = T == “T” ^ 

ds dn ds dn 


[ 40 ] 


in which dUldn is the normal derivative of U. 

This result shows that if the normal derivative is regarded as a vector 
pointing in the direction of the normal, the derivative of U in any direction 
s is given by the projection of this vector upon a line having that direction. 
The normal derivative, therefore, has true vector character. This vector 
is called the gradient of U at the point at which the normal derivative is 
evaluated. 

The gradient is defined as pointing in that direction in which V in- 
creases and it is evidently a function of the space coordinates, »nce its 
magnitude and direction depend upon the point at which dVfdn is 
evsduated. For a geographical altitude function, the gradient at any 
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point indicates the direction of steepest ascent, and its magnitude equals 
the maximum rate of change of altitude with distance at that point. 

In equations, the gradient is written in the abbreviated form : grad U. 
If n denotes distance measured along the normal at any point in the 
direction in which U increases, and Ui represents a unit vector in this 
direction, the gradient is expressed by the vector equation 

grr 

grad U == ni— [41] 


If 5i denotes a unit vector in any direction s, the component of the 
gradient in that direction is given, with the help of the scalar product 
and Eq. 40, by 


grad, f/ = 5i • grad U = Si 


dU dU ^ dU -- 

— = —cosd = — [42] 

dn dn ds ^ ^ 


In a rectangular co-ordinate system, the components of the gradient are 


A IT A IT 

grad, U = — grad, = - 


ATT 

grad, ^ [43] 


and hence 


grad U = t— + J — + k — 
dx dy dz 


[44] 


A more compact form for this expression is obtained by defining the 
so-called Hamiltonian operator* 


dx ^ dy dz 

The equivalent of Eq. 44 then reads 

grad U = VU 


[45] 

[46] 


The operator V may in some respects be formally treated as a vector 
with the components d/dx, d/dy, djdz. These, however, cannot in general 
be manipulated as though they were ordinary algebraic coefficients. 
They are differential parameters (or operators) of the first order, and 
hence perform the operation of differentiation upon whatever function 
follows them. 

For example, if W is another scalar function, the rule for differentiating 

♦Since the symbol for this operator is an inverted Greek capital delta, it is frequently 
referred to by the name “ del,” and grad U is alternatively called “ del of UT' Another name 
for the operator V is ** nabla,” after the Greek name for a harp, which this symbol re- 
sembles in form. 
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a product shows that 

V{UW) - UVW + WVU [47] 

or 

grad (Z7ir ) = grad IF + grad U [48] 

Another example is the scalar triple product v^6 in which the operator V 
takes the place of the vector A. Here the relation 38 is not applicable 
without due attention to the implied operation of differentiation in the 
multiplication of V with the functions B and C. The correct evaluation of 
the expression V • [jB C] is given in a subsequent article (see Art. 16) 
after the operations V • A and ^ ^ A have been discussed. The present 
remarks are made merely to caution the reader against any careless ma- 
nipulation of the vector operator V. 

An important property of the gradient comes to light from a con- 
sideration of the so-called line integral of 
this vector function evaluated for an arbi- 
trary path extending between any tw'o 
points in space. Thus if 

/ = grad U [49] 

then the integral 

jf/ds [50] 

is referred to as the line integral of the 

vector function / between the points 

and b{x 2 yy 2 yZ 2 )> These two 

points are assumed to be connected by acon- 
tinuous path or curve S of arbitrary form. 
Distance along this curve is denoted by the 
symbol s. The differential vector distance 
ds is at any point tangential to the path in the direction of continuous 
progress along it from a to by and in magnitude equals the scalar increment 
of length ds. The scalar product / • ds then equals the component of / 
coincident with the direction of travel along the path at any intermediate 
point, multiplied by the corresponding path increment. This relationship 
is illustrated in Fig. 7. 

If / represents the force on some particle which is constrained to follow 
this path (like a bead on a bent wire), in the absence of friction the line 
integral SO evidently yields the total work done by the force as the particle 
travels along the path from a to b. 



Fig. 7. A scalar product in- 
volving the differential vector 

ds. 
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According to Eqs. 40 and 41, it is recognized that 

f ■ 6.S til - = = dU [51] 

dn ds ■' 

in which dU is the differential increment of work done by the force / in 
mo\dng the particle through the path increment ds * Since £/ is a function 
of position only, dU is a total differential. Hence 

C /■ ds = Tgrad f/ • ds = dU = U(x 2 ,y 2 M) - U{xuyiM) [52] 

wa Ua t/a 

From this result it is concluded that the line integral of the gradient 
bet^iceen any two points is independent of the path joining these points. 
Hence if the particle is subsequently returned to the point a along any 
other path from h to a, the total work done by the force / as the particle 
traverses the closed circuit from a to 6 and back to a, is zero. 

SNTnbolically, the line integral extending over a closed circuit is 
indicated by a circle placed upon the integral sign. Thus the vector func- 
tion defined as the gradient has the property that 

^ grad • ds = 0 [53] 

It is important to observ’e that the truth of the results expressed by 
Eqs. 52 and 53 depends upon the single valuedness of the function U, 
This restriction on the function U, which is stated in the opening para- 
graph of the present article, is not always met by functions dealt with in 
practical problems. Further considerations, necessary when U is multi- 
valued, are given in Art. 14 of this chapter. 

In the above argument, the force / is regarded as due to some external 
' agency which is causing the motion of the particle along a given path. 
This external or driving force must, of cour e, be balanced by an equal 
but oppositely directed force of reaction. The latter is due to the inherent 
properties of the medium or system through which the particle is moved. 
If F denotes this force of reaction, evidently ^ 

F = —grad U [54] 


In this connection, U is spoken of as the potential function of the sys- 
tem, and F is the vector field of force associated with U,liUQ denotes 
the value of U at some chosen datum point, then U - Uq represents the 




200 


VECTOR ANALYSIS 


ICh. V 


work which must be done upon the particle to move it from the datum to 
the point to which U refers. This work is called the pokntial energy of the 
system consisting of the given particle and the medium in which it is 
embedded. 

A simple example is the potential energy of a particle (usually con- 
sidered to have unit mass) located at some altitude above sea level. 
The function F is then referred to as the earth's gravitational field 
of force. Here the significance of the relation 53 is readily visualized. 
Thus, whatever work may be done by the external force / while the 
particle traverses part of the closed path is returned to the external agency 
during the traversal of the remainder of the circuit. This would be the 
case, for example, if one were to carr>" some object around a closed path 
on the side of a mountain. 

The work which the external agency may contribute during the 
traversal of a certain portion of the path is thought of as stored by the 
force field through which the particle is moved, and the significance of 
Eq. 53 is that such stored energy is not lost but may be completely 
regained. A force field which has this property of conserving whatever 
energy" increment may be imparted to it is called a conservative field. This 
is the property of a vector field defined by the gradient function. 

Such a vector field is evidently irrotational in character, for if the flow 
map for this field were assumed to contain any lines which close upon 
themselves, one of these could be chosen as the path for the integral in 
Eq. 53, and the value of this integral would then certainly not be zero. 
Hence the gradient always defines a purely potential field, all the lines of 
which must emanate from sources and terminate upon sinks. A conserva- 
tive field is always irrotational. 

Conversely, if for a given vector field F it is known that 

^Fds = 0 [55] 

it must be possible to define a potential function U such that F is given by 

Eq.44. 

6. The divergence 

The sources of a potential field are sometimes thought of as con- 
centrated at points or distributed along filaments or over surfaces. 
Whereas such source distributions may, in the discussion of certain 
physical problems, be convenient from an analytic point of view, they 
nevertheless are idealisations which require a proper mathematical 
interpretation. For the present discussion, the sources of a potential 
field are considered to be continuously distributed throughout space. 
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According to the hydrodynamic analogy, a region in which sources are 
located is one from which fluid emanates in a continuously distributed 
fashion, like oil seeping up through the pores of a bed of quicksand. (The 
pores as well as the rates of flow through them are to be thought of as 
being infinitesimal.) The object of the present discussion is to formulate 
some means for describing the fluid productivity of an infinitesimal 
element of space in a source-filled region. In other words, some measure 
of source density or intensity is needed in order that a given distribution 
of sources may be described and the relation of this distribution to the 
associated field intensity may be determined. 

The net rate at which fluid emanates from a small but finite productive 
region may be measured through integrating the rate of flow over a 
surface enclosing this region. If this net rate is divided by the enclosed 
volume, the resulting figure represents an average rate of productivity 
per unit volume for this region. The actual rate of productivity per unit 
volume may, of course, vary from pioint to point throughout the region. 
At any given p>oint, it is expressible as the limit of the average rate 
obtained through shrinking the enclosure about that point until the 
contained volume becomes infinitesimal. 

This limiting value is a convenient measure of the intensity of the 
source region at any point, and it is referred to as the diver geme of the 
flow field at that point. Although the flow field is a vector function, its 
divergence is clearly a scalar. 

If A is any vector function, then in accordance with the discussion 
just given, its divergence (abbreviated div A) may be mathematically 
defined by 


divi4 = limit 
J'd9—*0 


^ A - da 

closed i »\irface 

jdv 

enclosed volume 


[56] 


The circle on the integral sign in the numerator indicates that the inte- 
gration extends over a closed surface. In this integral, da is a vector surface 
increment. Its magnitude equal's the scalar differential area da at any 
point on the surface, and its direction is that of the outwardly directed 
normal at that point. The scalar product A • da, therefore, represents the 
product of the normal component of A and the scalar differential area da 
at the same point. This equals the rate at which fluid passes through the 
surface element da, if A is thought of as representing the velocity field 
of a fluid. 

The integral in the numerator of Eq. 56 is then seen to equal the total 
rate at which fluid passes outward through the dosed surface. The integral 
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in the denominator of this expression represents the enclosed volume. 
The resulting limit of the ratio of these two integrals yields the divergence 
of A at the point about which the closed surface is shrunk by the limiting 
process. Its value is in general different for different points in space. 

In order for div A to be calculated when the vector function A is 
given, Eq. 56 must be further evaluated. For this purpose, the enclosed 



Fig. 8. Enclosed volume assumed in the calculation of div A, 


volume is effectively assumed in the form of a rectangular parallelepiped 
with its center located at the origin of a rectangular co-ordinate system, 
and three of its coterminous edges coincident in direction with the co- 
ordinate axes as shown in Fig. 8. If the sides of this parallelepiped are 
identified with the differentials dXy dy, dz, the desired result is at once 
obtained without the necessity of subsequently carrying out the limiting 
process. 

The vector function A is assumed to be finite and continuous in the 
vicinity of the parallelepiped, so that, except for differentials of the 
second order, the variation of A is linear throughout this infinitesimal 
region. If Az^ Ay, A z denote the values of the components of A at the 
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origin, for the surface of the parallelepiped 

da - (4. + ^ I) - (4. - ^ 

+("*•+ ^ 1)*“*’’ ■ (-*' ■ Hi f)*'*’' 

which yields 

y^.d..(^' + ^« + ^)*,d,* [58] 

The enclosed volume is given by 

Jdv — dv = dxdydz [59] 

Hence, in rectangular co-ordinates, Eq. 56 evaluates to 

[“1 

By means of the Hamiltonian operator defined by Eq. 45, and the 
form for the scalar product given by Eq. 7, the result stated in Eq. 60 
may be written 

div A — V ’A [61] 

It is significant that the partial derivatives in Eq. 60 are alwa)^ to be 
evaluated at the same point in space at which the divergence of A is 
desired. If the result is numerically positive, the point in question is a 
source; if it is negative, the point is a sink. If the result is zero over a 
finite region, the latter is source free. On the other hand, if div is zero 
throughout dl space, it may be concluded either that A is zero ever 5 nvhere 
or that this vector function describes a field which is purely rotational in 
character.* The possibility that A might be constant throughout all 
space must be discarded on the physical ground that any field must 
vanish at infinity (exceptions to this rule are due to idealizations which 
are physically unrealizable). 

7 . Gauss’s law 

According to the definition of the divergence as expressed by Eq. 56, 
*T1iis alternative is considered in greater detail in Art. 10. 
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it follows that 

• da = J div A dv [62] 

cl(K'ipd 8urfaf'e enclofted volume 

Since the integrand in the right-hand integral, namely, div A dv, repre- 
sents the fluid producti\dty for the volume element dv, whence this 
integral yields the total productivity for the finite region, a result that is 
alternately given through integrating the fluid flow over the enclosing 
surface. 

This result, which is known as Gauss's la^c, formally represents the 
transformation of a surface integral into a volume integral, or vice versa. 

It should be observed that the hydrodynamic analogy, used in the 
preceding article to lend concreteness to the definition of the divergence, 
tacitly implies that the “ fluid ” be iucofnprcssible. An arbitrarv' vector 
function may be likened to the flow of an incompressible or ideal fluid. 
In this light. Gauss’s law becomes almost self-evident, since it states 
merely that all the incompressible fluid produced within a given region 
(this is the volume integral of div A) must issue from the enclosing 
surface. 

Gauss’s law is generally applicable to any vector function for which the 
divergence exists, that is, for which the partial derivatives given in Eq. 60 
can be formed. If this condition is not, the vector function A is in general 
regular and continuous throughout the region involved. When these 
conditions are not fulfilled, the desired transformation may in certain 
cases still be achieved by means of special manipulations to which the 
following discussion is pertinent. 

8. Idealized soltrce distributions 

If the region contains points at which A is infinite or discontinuous, 
the divergence at such points cannot be evaluated by means of the formula 
given in Eq. 60. Such situations occur in practical problems under 
idealized assumptions. For example, it is sometimes convenient to 
assume that a source region is two-dimensional or one-dimensional, or 
even that it has zero dimensions. These idealized source distributions 
are referred to respectively as a surface distribution, a filamenial dis- 
tribution, or as a point source. 

For the continuous distribution of sources considered in the previous 
article, the source density or productivity per unit volume may con- 
veniently be denoted by some symbol such as p, and defined in con- 
formance with its analogy to electric charge density by the equation 

div = p [63] 
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Equation 62 may then be written 

^ A • da. = J p dv [64] 

closer! surface enclosed volume 

Here the volume integral on the right represents the total productivity 
of the region enclosed by the surface over which the integral on the left 
extends. Equation 64 evidently holds regardless of how the source 
density p is distributed throughout the enclosed region, and hence it is 
{X)ssible to assume (if convenient) that the total producti">ity is con- 
centrated at one or more points. These are then referred to as point 
sources. 

In certain physical problems, the actual distribution of source density 
approximates this idealization closely enough to justify such an assump- 
tion, and thus a simpliheation in the resulting mathematical relationships 
for the determination of the field is made available. 

A filamental source density distribution is similarly an idealization 
found convenient in certain types of physical problems. It is significant, 
however, that for the point or filamental types of source distributions 
the concept of the divergence does not apply. External to the points or 
filaments, the divergence is, of course, zero, whereas for points coinciding 
with these idealized sources the divergence as defined above becomes 
infinite. 

For a point source, the total productivity (point charge) may be 
defined by the relation 

e = pdv [65] 

in which the density p is, of course, infinite because e is assumed finite. 
Similarly, for the filamental distribution, a productivity per unit length 
may be defined as 

q = pda [66] 

the cross-section of the filament being denoted by da. Here again, p must 
be considered infinite. 

Actually there can be no infinite density p, but the practical examples 
to which the idealizations expressed by Eqs. 65 and 66 apply are such 
that the geometrical relations are closely approximated when the actual 
finite source region or filament cross-section is replaced by the infini- 
tesimals dv and da respectively. 

In an analogous manner, a surface distribution of sources is defined 
as having a productivity per unit area given by 

a = pds [67] 

in which the differential thickness ds represents the actual small finite 
thickness of a continuous distribution in the form of a layer. 
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For this surface distribution, it is useful to extend the definition of 
the divergence by making use of the relations 63 and 64. A differential 
surface element da of the source layer is assumed to have a thickness ds 
which is vanishingly small in comparison with the surface dimensions of 
da. This is a perfectly admissible assumption which in fact adds to the 
preciseness of the definition 67. The integral on the left of Eq. 64, applied 
to the surface enclosing this element, reduces to the flow outward from 
the two opposite faces of the layer element, so that this equation yields 

<T da — (-4„1 -«4n2) 

in which A^i and are the outwardly directed normal components of 
A on the two sides of the layer. By analogy to Eq. 63 it is then possible to 
define a so-called surface divergence per unit area, given by 

di\V^ = <r = Anl + An2 [69] 


9. The scalar potential function associated with a given 

SOURCE DISTRIBUTION 


If i4 is a vector function describing a potential field, it is related to its 
sources by Eq. 63, and to a scalar potential function U by an equation 
similar to Eq. 54. Hence 

div A = — div grad U == p [70] 

and it follows that a potential function U may be associated with any 
given source distribution p. 

By means of Eqs. 46 and 61, this relationship may be written 

V • Vt7 = ~p [71] 

and, with the help of Eqs. 44 and 60, the interpretation of this form is 
readily seen to be expressed by 

dW ^ d^U 

dx^ a/ dz^ ~ '' 



Alternatively, the scalar product V • V may be interpreted as the resultant 
operator 


V- V = 


il 

dx^ ^ 


[73] 


which yields Eq. 72 when applied to the potential function U. 

The differential operator of the second order defined by Eq. 73 is 
referred to as the Laplacian operator, and the form 


V- VU 


d^U dW d^U 

dx^ df dz^ 


[74] 
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is spoken of as the “ Laplacian of U For purposes of abbreviation, 
V • V is alternatively denoted by the symbols and A, so that Eqs. 71 
or 72 are frequently written either as 

V2£/ = -p [75] 


or as 


^V ^ ~p 


[76] 


the latter notation being the simpler but subject to some objection on the 
ground that the symbol A might, according to its more common inter- 
pretation, be confused with the notation for an increment. 

The significant point of the present discussion is the fact that a given 
source distribution defines an associated scalar potential function by 
means of Eq. 72, and this in turn determines the associated vector func- 
tion A by means of the relation 


id = — grad V [77] 

This view of the relation between the functions A and p may seem to be 
more roundabout than the apparently simple one expressed by Eq. 63, 
but in the solution of many practical problems it proves to be a more 
convenient method of determining A from p, because Eq. 72 involves 
scalar functions only and hence is frequently more easily integrated than 
Eq. 63. Once the function U is obtained, it is usually not difficult to 
determine A from Eq. 77 because doing so involves differentiation only. 

Moreover, there are certain problems in which the source distribution 
p is not known, but, instead, U and its normal derivatives are known 
over boundaries of given geometrical form to which the unknowm source 
distribution is restricted. For the solution of these so-called boundary- 
value problems, the attack by means of Eq. 72 is the only possible one. 
At all points not located on the boundary, p - 0, and this equation reads 


dW dW d^U 

^ ^ = TT + + VT 

dx^ dy- dz^ 


= 0 


[78] 


A function which formally satisfies this differential equation, and in 
addition meets the stated boundary conditions imposed upon U, con- 
stitutes the desired solution. 

Equation 72 is known as Poissons equation, and the corresponding 
homogeneous equation 78 as Laplace's. The latter equation is of im- 
portance in connection with any problem in which the mapping of flow 
fields is essential, whether this is accomplished by analytic or by graphical 
means.* 


•.\n application of this method to the determination of electric circuit parameters is given 
in FJectric Circuits, Ch. I, .\rt. 6a, pp. 16 ff. 
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10 . The curl of a turbulent* vector field 

In the present article, attention is turned from the potential field to 
what might be called its complement — the turbulent or rotational field. 
Here the flow lines form closed circuits, so that the line integral of the 
vector function .1 taken around a closed path is in general not zero. 
That is, for a rotational vector function A 

y^l-dsF^O [79] 

The seat or origin of a turbulent field lies in its vortexes or whirlpools. 
It is clear that the greater the intensity of the whirlpools in a given 
region, the greater is the value of the integral 79 evaluated for various 
contours within this region. It is, therefore, reasonable to suggest that 
the value of this integral be used as a basis for defining the vortex density 
at any point. 

If the vector A is thought of as representing the force acting upon a 
particle, the integral 79 represents the work done by this force field as 
the particle is allowed to traverse a closed path. The path may, for the 
moment, be thought of as circular, with a radius r. The value of the 
integral 79 divided by 2ir then represents an average torque with respect 
to a concentric axis normal to the plane of this circular path. This average 
torque evidently varies with the angular orientation of the axis for a 
fixed location of the center of the circular path. There will evidently be 
one orientation for which the average torque is a maximum. As the radius 
is allowed to shrink until it becomes zero, the maximum average torque 
becomes zero also, but the ratio of this torque to the area of the circle is 
found to approach a finite limit when the force A has finite values. It is 
this limiting value of the ratio of the maximum average torque to the 
enclosed area which thus proves to be a useful measure of the turbu- 
lence of the field A at any given point. This value, except for tlie factor 
2ir, is called the curl of A at that point. 

It is not essential for the definition of the curl that the closed path be 
circular. It is essential, however, that this path lie in a plane so that the 
orientation of the normal be clearly defined, for the curl is a vector 
having this same orientation. Thus the curl of A is given in magnitude 

♦In some more recent considerations of hydrodynamic fields, the term turbulent is uscri to 
designate an entirely random character that specifically does not permit a mathematical 
representation in terms of either the gradient or the curl. Consideration of fields having such 
random character is not included in the present discussion. The term turbulent in this volume 
is used merely as a means of distinguishing the rotational from the irrotational character 
of a field. 
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by the maximum value of 

‘ ds 

limit [80] 

_ enclo.wl surface _ 

as the plane of the contour assumes all possible orientations. In direction, 
the curl of A coincides with the normal to the plane of the contour^ 


Z 



Fig. 9. An infinitesimal tetrahedron to illustrate the vector character of the curl. 

pointing in agreement with the advance of a right-hand screw which 
turns so as to correspond to the traversal of the contour in the evaluation 
of the closed line integral. The vector, curl Ay is thus uniquely defined in 
direction as well as in magnitude. 

Just as the divergence of a potential field is at any point a measure of 
the density of its sources at that point, so the curl of a turbulent field 
establishes an analogous relationship between the intensity of that field 
and its vortex density at any point. 

The vector character of the curl may be demonstrated by means of the 
geometrical configuration in Fig. 9, showing an infinitesimal triangular 
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contour ABC whose normal n has an arbitrary orientation relative to a 
reference system of co-ordinates. The sides of this triangle together with 
portions of the co-ordinate axes form the sLx edges of a tetrahedron with 
vertexes Oj A j B, C. The sides OBC, OCAj OAB, normal respectively to 
the A"-, F-, and Z-axes, have infinitesimal areas which are denoted by 
da^y dcyj and da The area of the side ABC opposite the vertex O is 
denoted by da. Since daj^, day, da., are the projections of da upon the 
co-ordinate planes YZ, ZX, and A'T respectively, one may write 

1 _ cos (n,x) _ cos (w,y) _ cos in,z) P , 

da daj: day da^ 


in which cos (u,x), cos (w,y), cos («,::) are the direction cosines of the 
normal n. 

It is likewise clear from the geometry of the configuration that 

£ A ' ds = £ A • ds -\- £ A ■ ds -\- £ ^4 • ds [82] 

JaBC done doc A doAIi 


inasmuch as the net integration represented by the right-hand side of this 
equation involves traversals in both directions along the edges of the 
tetrahedron emanating from the vertex O (see the circulatory arrows in 
Fig. 9). In terms of the notation 


and 


R. = 


f 

dOBC 


A • ds 


da^ 


R = 


i 


A • ds 


ABC 


da 


R.. 


f 

doCA 


A • ds 


da,j 


Rz - 


/ 

do AH 


A • ds 


da^ 


[83] 


[84] 


the results expressed by Eqs. 81 and 82 show that 

R = Rx cos {n,x) + Ry cos {n,y) + R^ cos {n,z) [85] 

Subject to the condition 

cos^ (n,x) + cos^ (n,y) -h cos^ {n,z) = 1 [86] 

the expression on the right-hand side of Eq. 85 attains the maximum 
value 


Rma. = VRjnniJTT? 


[87] 
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for the particular orientation of the normal given by* 


cos (w,x) 


R. 


cos (n,y) = 



cos (n, 2 ) = 



[ 88 ] 


Since the relations expressed by the last two equations are the familiar 
ones existing between a vector and its rectangular components, the 
desired proof is completed. 

It is clear that the curl is an axial vector similar to that representing a 
mechanical torque, the right-hand screw rule serving in both instances to 
relate the rotational motion to the axial direction of the vector. The com- 
ponents of this vector with resp)ect to a rectangular co-ordinate system 
may be determined by noting that the component of the curl in any 
direction s is given by 


curl, A — limit 

J'da—m 


^ A • ds 


plane contour normal to « 



enclosed surface 


[ 89 ] 


in which the direction of 5 is linked with the direction of traversal in the 
line integral l)y means of the right-hand screw rule. 

The .v-component of the curl is thus found from consideration of a 
closed contour lying in the yi:-plane. For convenience, this is taken to be 
the contour of a dilTercntial rectangle with its center at the origin, as 


•To ina.ximizc /(ar,>',s) = ajc -f by -f cz subject to the condition at -1- + -" = 1, form: 

/ — ax -f ^ -h y” 4- 2“ — 1). Equating the partial derivatives to zero gives: 


^1 

dx 


= a — Xx = 0 


dy 

K 

ds 


ft — Xy = 0 
c — Xs = 0 


X 


a 


y 


ft 

X 


c 

X 


The condition then yields X = \^a^ 4 ft* + so that one has 

/max = Va- 4 4 

and the particular values of the variables yielding this ma.ximum are 

a b c 


fmax 


For other applications of the method of determining conditioned maxima see Arts. 4 and 5, 
Ch. IV. 



212 


VECTOR ANALYSIS 


[Ch. V 


illustrated in P'ig. 10 . Since the co-ordinate itxes form a right-hand system, 
the A'-axis points upward from the plane of the paper, and the direction 
of traversal of the rectangular contour is as indicated by the arrows. It is 
clear that only the y- and ::-components of A contribute to the Jt*-com- 
ponent of the curl. 

The components of A in the direction of traversal along the sides 1 - 2 , 
2 - 3 , 3 - 4 , and 4-1 may be denoted respectively by /I12, ^2:^ ^^34, and 
/ 1 41. If /I j; and A: denote the values of the y- and ^-components of A at 
the origin, then, except for differential contributions of higher order 
(assuming A to be finite and continuous throughout the region covered 
by the rectangle), 


^12 = + 

-^23 = 


dAg ^ 

T 

dA y dz 
~ 2 


A 


34 


— ^Ag + 


dy 

y 


[90] 


and hence 


A4I — Ay 


dAy dz 

Hz y 


/ 


A • ds — A 


12 


dz + A 23 dy + A^4 dz -f- A41 dy 


(Ml 

\dy 



dz 


[91] 


According to Eq. 89, the a;-component of curl A , therefore, becomes 


3 

2 





OZ 

L_1 


0 


4j 



1 


r* — dy — H 



Fig. 10. A differential 
rectangle centered at the 
origin. 


dAg dAy 

curU A = [92] 

dy dz 

In an analogous fashion, or through simply 
advancing the cyclic order of the letters a*, y, s, 
the y- and 2;-comp)onents of curl A are found to be 


curly .4 = 


dAg 

dz 


dAg 

dx 


[93] 


curb A 


dAy 

dx 


dAg 

dy 


[94] 


Comparing the turbxilent with the potential field again, it is interesting 
to observe the significance of the conditions under which 



•ds = 0 


[95] 
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According to the discussion in Art. 5, these conditions are stated by 

^ - grad U [96] 


or 


Now since 


. dU ^ dU , dU 
dx ^ dy dz 


d^U d-U 


* > etc. 


[97] 


[98] 


dx dy dy dx 

it follows that the conditions 97 may alternatively be stated in the form 


dA^ 

dy 

dA^ 

dz 

dx 


dz 

[99] 

o 

II 

[100] 

o 

II 

[101] 


The expressions on the left-hand sides of these equations are the com- 
ponents of curl A given in Eqs. 92, 93, and 94. Hence it is seen that 

curl A = 0 [102] 

becomes the necessary and sufbeient condition for the vanishing of the 
line integral as expressed by Eq. 95. 

If over a given region the curl of A vanishes at all points, the line 
integral of A for any closed path within this region vanishes also,* and A 
may there be represented as the gradient of a scalar potential function 
as expressed by Eq. 96. Conversely, if the vector function A is the gradient 
of' a scalar, its curl must be zero; that is, A must be nonturbulent. The 
gradient then, has no curl. In symbols, 

curl grad U ^ 0 [103] 

This statement is verified through substituting the relations 97 into the 
Eqs. 99, 100, and 101. 

It is useful to observe that the components of curl A given by Eqs. 92, 
93, and 94 may be combined into a single compact vector expression by 
means of the determinant form 

i j k 

^ A d d d 

curl A = — V "T 

dx dy dz 

Ax Ay Az 

*Tliis statement is subject only to the restrictions pointed out in Art. 14> which need not 
be considered at the moment. 
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The Laplace development of this determinant for the elements of its 
first row is readily seen to yield 

i curlx A 4- j curly A + k curU A [105] 

in which the respective comp)onents arc the expressions 92, 93, and 94. 

By recalling the determinant form 25 for the vector product and the 
definition of the Hamiltonian operator V as given by Eq. 45, it is recog- 
nized that an alternative compact form for curl A reads 

curl A = V ^ A [106] 

In summary, the three important vector operations, gradient, diver- 
gence, and curl, are seen to be expressible in terms of the Hamiltonian 
operator, thus 

grad U =VU 

div A = N ' A [107] 

curl A = N ^ A 


11. Stokes’s law 

According to the definition of the curl as expressed by Eq. 80 or in 
component form by Eq. 89, it follows that 

^ A ■ ds = J * (curl A) ' da, [108] 

any clasefl contour any surface l>ounde<I 
by that contour 

This result, which is known as Stokes’s law, formally represents the 
transformation of a closed line integral into a surface integral, or vice 
versa. Stokes’s law is the counterpart of Gauss’s law in the sense that it 
expresses for the purely rotational field what Gauss’s law expresses with 
reference to the potential field. 

It should be observed that the closed contour in Eq. 108 is not restricted 
to lie in a plane, and that the surface bounded by this contour may have 
any shape. For example, the contour may be visualized as a warped hoop 
and the surface as that of a rubber membrane bounded by the hoop but 
allowed to be stretched out sideways into any form whatever. If the 
hoop is thought of as being moderate in size, one may consider blowing 
the rubber membrane out sideways until it becomes inflated like a 
balloon, which may bulge backward over the ring, etc. 

The vector surface increment da, which is normal to the surface, points 
in that direction which is determined from the direction of traversal of 
the boundary by the right-hand screw rule. This correlation is shown in 
Fig. 11. In Fig. 12 the surface is for convenience drawn as a plane. The 
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scalar product (curl ^4) • da equals the line integral of A for the closed 
boundary of any one of the surface elements. The surface integral repre- 
sents the sum of all these elemental closed line integrals. This sum 




Fig. 11. Correlation of contour traversiil with Fig. 12. The closed line 

element traversal when bounded surface does not integral equals the sum of 

lie in a plane. the elemental line integrals. 


evidently equals the line integral around the large boundary because the 
common boundaries of the surface elements are traversed in both direc- 
tions and their contribution to the surface integral is zero. Thus the 
validity of Stokes’s law, as stated by Eq. 108, is established. 


12 . The vortex distribution of a turbulent field 

For the turbulent held, it is convenient to define a x'OTtex density J by 
means of the relation 

curl A — J [109] 

in which J is analogous to the current density occurring in the study of 
electricity and magnetism.* Stokes’s law, Eq. *08, then yields 

ds = f-f da. [110] 

closed contour enclosed surface 

Equations 109 and 110 are analogous to Eqs. 63 and 64. They relate the 
turbulent field to its vortex distribution J just as the potential field is 
related to its source distribution p. The vortex density J is the cause of 
the turbulent field just as the source density p is the cause of a potential 
field. 

The essential difference is that p is a scalar function of the space co- 
ordinates whereas 7 is a vector function. In general, / is a finite and 

*In this analogy the vector function A represents the magnetic field intensity and should 
not be confused with the magnetic vector potential which is customarily denoted by the 
letter A, 
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continuous function; that is, the vortexes are thou^^ht of as continuously 
distributed over certain regions just as the source density p is in general 
continuously distributed. In certain practical problems, however, it is 
convenient to idealize the vortex distribution. The situation is again 
analogous to the idealized source distributions discussed in Art. 8, except 
that there is no vortex analogue of the point source.* 

There is, however, a vortex analogue of the lilamental source distri- 
bution. It is stx)ken of as a vortex thread. The amount I of the vortex 
thread is defined by the relation 

I = Jda [111] 

in which da is written in place of the actual finite cross-section of the 
filament. Since I is finite and da dilTerential, this idealization requires 
the concept of an infinite vortex density 7. The latter, as well as the 
vector /, is ever>*where directed tangentially to the filament. The vortex 
thread may be thought of as a filament along which are concentrated 
whirlpools of infinite intensity, and about which a fluid is set into a 
swirling motion. The central portion of a tornado or “ twister, ’’ and that 
of a " waterspout,’’ are hydrodynamic examples permitting an idealized 
representation by means of a vortex thread. 

A surface distribution of vortexes may be thought of as the result of 
many threads placed side by side like the waq) in a weaver’s loom. The 
vector moment I has the same value and direction for all the threads. 
A surface density g may here be defined by the relation 

g - J ds [112] 

in which ds represents the thickness of the surface or layer. There is no 
whirling action around the individual threads, that is, through the sur- 
face, because of the cancellation of this action along adjacent sides of the 
threads. The vortex surface merely causes a translatory motion of fluid 
in opposite directions along its tw^o sides. Thus the tangential component 
of the vector field A is observed to change suddenly as the point of 
observation is shifted from one side of the vortex surface to the other at 
any given point. 

Figure 13 shows the vortex surface in cross-section, the cut being 
parallel to the tangential components of A and at right angles to the 
direction of g. The line integral in Eq. 110 is evaluated for the closed 
rectangular path with the differential sides ds and dL Here ds is assumed 
to be so small compared to d! that the corresponding contributions to 
the line integral are negligible in comparison with those for the sides dL 

•In this connection it may be observed that one does have a vortex analogue of the double 
point source (the doublet or dipole), namely, the vortex ring. 
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Equation 1 10 then yields 

{Ati “ At2)df = Ids df = gdf [113] 

in which Ati and At2 arc the tangential components of A on the two 
sides of the surface. 

By analogy to Eq. 109, it is then possible to 
define a surface curl per unit length (in the df di- 
rection), given by 

curl, id = g = An - A, 2 [114] 

This vector points upward from the surface of the 
paper (the assumed direction of the vortex density 
g) in Fig. 13. If W 12 denotes a unit vector normal 
to the vortex surface pointing from side 1 to side 2, 
the direction of the surface curl is contained in the 
right-hand screw rule for the vector product in the 
delinition 

curU ^'1 = g = (Ail — -4^2) ^ «i2 [115] 


I 

r|' 


vortex 
surface - 


^closed 

path 


-ds 


Fig. 13. A vortex 
surface in cross 
section. 


13. The vpxtor potential function associated with a given 
VORTEX distribution 


The present article seeks to show that the vortex distribution of a 
turbulent field determines a vector potential function in a manner 
similar, except for certain anti-parallelisms, to that relating a scalar 
potential function to the source distribution of a potential field, as dis- 
cussed in Art. 9. 

The first step in this argument is to obser\"e that if Stokes’s law, Eq. 
108, is applied to a closed surface, the line integr d on the left-hand side of 
this equation is zero. This fact may be \Tsualize‘ 1 through first considering 
the surface to be balloon shaped and the closed contour or boundary to 
be the small loop located at the throat through w^hich the balloon is 
inflated. As this loop is contracted until the balloon is finally tied off, the 
closed contour shrinks to zero and the line integral vanishes. 

According to Gauss’s law, Eq. 62, it foUow's, therefore, that 


^ (curl A) • da = J* div (curl A) dv = 0 


closed surface 


enclosed volume 


[ 116 ] 


Since the size of the enclosed volume is arbitrary, it may be considered 
to be infinitesimal; whence this result yields 


div curl A = 0 


[ 117 ] 
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The conclusion is that the vector field given by the function curl A is 
source-free; in other words, curl ^ is a purely turbulent vector field. This 
conclusion is true for any vector function A, The relation 117 is an 
identity. 

The curl then, may be said to have no divergence. This statement is the 
complement of the one made in Art. 10 to the effect that the gradient has 
no curl. Whereas the gradient function represents a pure potential field, 
the curl represents a purely turbulent field. 

The identity 117 may readily be checked independently through 
substituting the components of the curl given by Eqs. 92, 9vS, and 94 for 
the components Aj, Ay, A ^ in the expression for the divergence as given 
by Eq. 

If a given vector function B is knowm to represent a purely turbulent 
field, that is, if 

div ^ = 0 [118] 

the result just obtained permits the function B to be e.xpressed as 

5 = curl A [119] 

the thought being that the vector function A is thereby specified in terms 
of J5, or vice versa. Here it must be observed, however, that whereas B 
is uniquely determined in terms of A by means of Eq. 119, the reverse 
is not true. 

This operation is readily appreciated from the fact that if the vector 
function A is assumed to be f)erfectly general, it may represent a field 
which exhibits both potential as well as turbulent characteristics. In 
other words, A may be represented by 

A = T+P [120] 

in which T is the source-free (or turbulent) component of A, and P is its 
curl-free (or potential) component, that is 

divr = 0 [121] 

and 

curl P = 0 [122] 

Substituting the expression 120 for A into Eq. 119 yields 

B = curl T [123] 

because of the relation 122. 

It thus becomes clear that Eq. 119 alone does not determine the whole 
function A, but only its source-free component. If the vector function A 
is introduced by Eq. 119 merely for convenience, that is, as an auxiliary 
function in the course of an analysis, this difficulty may be overcome by 
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the further demand that 

div = 0 [124] 

The vector function A is then uniquely characterized in terms of B by 
means of the two relations 119 and 124. 

This situation is similar to having given a purely potential field F, 
and assuming that a scalar potential function U may thereby be defined 
in terms of the relation 

F = — grad U [125] 

In this case, U is determined only within an additive arbitrary constant, 
since the gradient of the latter is zero. Just so, Eq. 1 19 determines A only 
within an additive arbitrary vector function representing a potential 
field, since the curl of the latter is zero. Setting the constant component 
of 6'' equal to some chosen value (for example, zero) is analogous to 
setting the divergence of A equal to zero in the present discussion.* 

It may now be supix)sed that the given source-free field represented by 
B has a vortex distribution described by the vector function J\ that is 

curl B = J [126] 

From Eq. 119 it then follows that a vector function A is related to this 
vortex distribution by means of the equation 

curl curl A = J [127] 

subject to the condition expressed by Eq. 124. 

The function A which is thus related to a given vortex distribution J 
by means of Eqs. 124 and 127 is called the vector potential associated with 
B, because it plays the same role relative to B and its vortexes that the 
scalar potential U plays relative to the associated nonturbulent field F 
aiid its sources. If the vector potential A is found from a given vortex 
distribution J by means of Eqs. 124 and 127 the turbulent field B is 
determined from Eq. 119. This method of determining B from J, al- 
though apparently roundabout, is frequently found to be more con- 
venient than that of attempting a solution directly in terms of Eq. 126. 

It is necessary to interpret further the repeated curl operation in Eq. 
127, as is readily done by means of Eqs. 92, 93, and 94 for the components 
of the curl. Thus the a-component of curl curl A is seen to be given by 

d /dAy _ _ d /dA^ _ ^ 

dy \ dx dy / d 2 \ dz dx ) 

dxdy d/ d 2 * dxdz 

*Any other disposition of the value of div A , appropriate to the circumstances pertaining 
to a specific problem, may likewise be made. 
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^2 A 

Adding and subtracting from this expression the term — ir » and ap- 

dx" 

propriately grouping the result, show that 
curl, (curl A) ^ 

dx\dx dy dz / 




By means of Eqs. 60 and 74, this may be written 


curlj. (curl A) = — (div/i) — 
dX 


[130] 


in which V* is the Laplacian operator defined by Eq. 73. 

The y- and c-components of curl curl A are found in a like manner, or 
more simply through advancing the cyclic order of the variables .v, y, z in 
Eq. 129. The result is 

( d S \ 

^ ^ ^ ^ d ) ^ 

— “1“ j^y 4" kA,) [131] 

which is 

curl curl A = grad div A — div grad A [132] 

or 

curl curl A -= r(r • /I) - V'A [133] 

It is interesting that although the gradient of the vector function A is 
not defined,* the Laplacian of Ay or div grad A, is interpreted according 
to Eq. 131 as 

div grad A = VM = i + j V^Aj, 4- k V^A^ [134] 

Equation 127, subject to the condition 124, is now seen to yield the 
equation 

VM = [135] 

relating the vector potential associated with B to its vortex distribution. 
The nicety of this result is that it is identical in form to Eq. 75 relating 
the scalar potential U to the source distribution p of a nonturbulent field. 
The only significant difference is that Eq. 135 involves vector functions 
whereas Eq. 75 involves scalar functions. This fact means that the vector 
equation 135 is equivalent to three scalar equations of like form, one for 
each of the three corresponding components of A and J. 

•An interpretation of the gradient applicable also to vector functions is given in Art. IS 
of this chapter. 
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14. The possibility of a multivalued potential function 

It has been shown that if, throughout a given region, a vector field A 
is conservative so that curl A is identically zero at all points within this 
region, A may there be expressed as the gradient of a scalar potential 
function U . Furthermore, it has been pointed out that the existence of such 
a potential field is in general due to a distribution of sources which, ac- 
cording to the hydrodynamic analog>% are fluid-producing regions. One 
may be tempted to conclude, therefore, that a potential field can be due 
only to sources, and never to vortexes, because these latter are regions of 
turbulence and hence can produce only turbulent fields. 

A moment’s reflection, however, yields the thought that if a distribu- 
tion of vortex density J is confined to a finite region of space, only within 
that region is the curl of the associated vector function A different from 
zero. Everywhere else the curl of A is zero, and the field is, therefore, 
potential (that is, conscr\*ativc) in character. 

A very imjx)rtant distinction, however, should be made between the 
conscr\’ative field which is due only to pure sources, and that which is due 
to vortex distributions restricted to certain excluded regions of space. To 
state the matter in another way, it is important to distinguish between a 
field which is conserv'ative throughout all space and one which is conserva- 
tive only within certain portions of space, or to the exclusion of certain 
portions. The field which is conserv^ative everywhere can have its origin 
in true sources only, whereas the one which is conservative within re- 
served regions (finite or infinite in extent) can be due to vortexes as well 
as to sources, although the former are confined to lie outside the reserved 
regions. 

This distinction is concerned primarily with the nature of the associ- 
ated scalar potential function U. Thus, if the field is conserv^ative 
throughout all space, or throughout a simply connected region b (see 
Fig. 14'), the associated potential function U :here is single valued; but 
if the region over which the field is conservative is a multiply connected 
one (Fig. 15), the associated potential becomes a multivalued function. 

A region is said to be simply connected if for rcery closed contour lying 
entirely within the region a surface bounded only by that contour can be 
constructed such that every point of the surface also lies within the region. 
The significance of this statement is best understood from an example of a 
region which is not simply connected. Thus the space outside a doughnut- 
shaped region is not simply connected because a closed contour which 
links the doughnut (passes through the hole) cannot form the sole 
boundary of a surface every point of which is required to lie outside the 
doughnut. If the doughnut is cut so that it no longer forms a closed ring, 
the space becomes simply connected. With the doughnut intact, the 
surrounding space is said to be doubly connected. 
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Another illustration of a doubly connected space is that surrounding a 
closed cylindrical surface of infinite extent like a straight tube which is 
infinitely long in both directions. Here a contour which encloses the 
cylinder cannot form the sole boundary of a surface which lies wholly 





any plane croes-eection 
of a aimply connected 
region must be a cloeed 
curve as in it must 
not cross itself as in “b'.’ 


Fig. 14. A simply connected region. 


outside the cylinder. If the space enclosed by the cylindrical surface is 
filled with vortexes whose cross-sectional distribution is given by the 
vector density J (directed longitudinally), then, since by assumption 
/ = 0 outside the cylinder, the associated field function A satisfies the 
equations 

cwAA = J [136] 


within the cylinder, and 

curl A = 0 [137] 

outside the cylinder. 

In the region outside the cylinder it is, therefore, possible to assume 

A = - grad U [138] 
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connecting canal 
or tube 



connecting tube d can be used instead of a but not both at the same time, 
the connecting tubra, of infinitesimal cross-section^ are the 
so-called barriers of the region 

Fig. 15. A multiply connected region. 


With reference to Fig. 16, which shows a cross-section through the 
cylinder, the line integral of A formed for the closed path Px is evidently 
zero because it does not enclose the vortex region. This fact follows 
readily from Stokes's law. However, for the closed path P^ the same law 
yields 

(f A 

JPi 

But by Eq. 138 

i/ 



and hence it follows that 

(grad U) • ds = — / 

Pi 


[141] 


If the line integral is thought of as extending from some point a on the 
path P2 around the closed contour back to the same point a, this result 
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indicates that 

£ (grad U) • ds = t/a - = / [142] 

in which is the value of the potential U at the point a. The only 
possible conclusion to be drawn from Eq. 142 is that the potential func- 
tion U is not uniquely delined for points in the space surrounding the 



Fig. 16 . One path of integration which encloses a vortex; another which docs not. 


region enclosed by the cylindrical surface. Indeed, if the line integral is 
extended around the contour Po, the result is 

r. - = 21 [143] 

and for n circuitations 

Ua - Ua = nl [144] 

The potential U at any point such as a clearly is defined only within an 
additive integer multiple of I. This is the nature of the multivalued 
potential function U in this example. 

The structure of the space surrounding the cylindrical region may be 
thought of as composed of leaves lying in the cross-sectional plane and 
winding about the cylinder as a winding stairway encircles a column. 
The density of these leaves longitudinally is infinite so that the process of 
encircling the cylinder any finite number of times around the contour 
P 2 without piercing any leaves involves no net longitudinal motion. Any 
point such as a is then referred to as Ui, a 2 , ^ 3 , • • • etc., according to an 
assumed numbering of these hypothetical leaves, and Eq. 144 may then 
be written more precisely in the form 

u.,.. - = nl [145] 

Although the multivalued potential function U may, through such a 
mathematical artifice, be transformed into a single-valued one, the rather 
complicated interpretation of the space outside the cylindrical region is 
given here principally for the purpose of lending visual clarity to the 
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concept of multivalucdncss. A practical problem involving these concepts 
arises in the subject of magnetostatics where the cylindrical region is 
identified with a current-carrying conductor, the density of the electric 
current being /, whereas the total conductor current has the value /. 


15 . The differentiation of scalar or vector functions with 

RESPECT TO THE TIME 


A given vector function A is considered to be a function of the space 
co-ordinates x, 3 ;, 2 and also of the time /, that is, 

A = A (x,y, 2 ,/) [146] 

The behavior of this function is studied at some point P in space, having 
the co-ordinates x, y, and z. More specifically, this point is denoted by 
P{x\\\z). This notation may refer to any specific point if proper values are 
assigned to x, y, and z. P is also referred to as ‘‘ the point of observation.’^ 
In considering the time derivative of A at the point P, or the rate at 
which the vector A changes with respect to the time at the point of 
obser\'ation, two possibilities must be distinguished, according to whether 
the ix)int of observ'ation is stationary or is in motion relative to the refer- 
ence co-ordinate system. In the former case, the variables x, y, and 2 are 
constant, and in the latter they are functions of the time — that is, the 
point P moves with a velocity v given by 

V = ivx +jvy kvz [147] 


with 



dy 


dt 



[148] 


If the point P is stationary (x, y, z are assumed constant), the time 
derivative of A is understood to be the partial derivative dA/dt\ if P is 
not stationary, the time derivative of A is given by the total derivative 
dAjdt, 

Identical remarks apply to the time derivative of a scalar function £/, 
which may, for example, be the potential energy of a particle located at 
the point P. If the particle is stationary, the potential energy may change 
with the time because of a time variation of the field in which it is em- 
bedded. This possibility is denoted by the partial derivative dU jbL If 
the particle is in motion, the net rate of change of its potential energy 
with respect to the time is denoted by dUldt, and is in general due to the 
combined effects of the time variation of the field in which the particle 
is embedded and of the particle’s own motion through this field. 

Neglecting differential effects of higher order than the first, the total 
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time derivative of the scalar function U is expressed by 
dU dU dUdx dUdv dUdz 

dt dt dx di dy dt dz dl ^ 

The first term on the right-hand side of this equation represents the time 
variation of U on the assumption that the particle at the point P is 
stationary. The sum of the remaining three terms represents the time 
variation of U due solely to the motion of the particle through the field 
which (as far as these terms are concerned) is constant at any fixed 
point but varies from point to point. The net time variation dUldl 
appears as a linear superposition of these two separate effects because 
the contribution of higher order derivatives may (according to the 
principles of the differential calculus) be neglected. 

According to the definitions of the gradient and the scalar product, 
Eq. 149 may be written in the form 

dU dU , , 

— = — + -grad:/ [150] 


in which the velocity v of the point of observation is defined by Eqs. 147 
and 148. If U is not an explicit function of the time - that is, if U is a 
function of the space co-ordinates alone — dU dt — 0, and dU dt is 
given by the second term in Eq. LSO alone. This term equals the com- 
ponent of grad U in the direction of multiplied by the magnitude of v. 
In other words this term equals the space rate of change of U in the direc- 
tion of i’, multiplied by the magnitude of v. It is convenient to regard this 
as a resultant op>eration upon the function U by defining the operator 


(i/-grad) = ('a - V) 
and by writing Eq. 150 




dt 


{^+ivV)U 


d 


dz 


[151] 


[152] 


The vector v in the operator defined by Eq. 151 may be replaced by any 
vector function B, and {B • V) is then referred to as the “.5-gradient.’^ 
The symbolic operation (5- V)U is read: “The 5-gradient of UP 
Unlike the gradient of U, it is a scalar quantity. 

The convenience resulting from defining this operator is seen when the 
operator is used in connection with a vector function A. Whereas the 
gradient of a vector function cannot be interpreted according to the 
definition of this vector operation, the 5-gradient is readily seen to be 
applicable to vector functions, and yields a vector, thus 

{B ■ V)A = HB • vMx +jiB ■ V)Ay + k{,B ■ VM, [153] 
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in which the A:-component reads 

= = + + [154] 

bx ^ by dz 


Similar expressions apply to the y- and s-components. 

'Fhe total variation of a vector function A with the time is, therefore, 
given by the expression 


dA __ bA 
dt “ bt 


{V • V)A 


[155] 


This is, in general, a function of the time. At any given instant, the vector 
(V • V)A represents the space rate of change of A in the direction of v, 
multiplied by the magnitude of ?. To state the case in another way, 
(? • V).'l is the vector change (increment) of A per unit displacement of 
the ix)int of observation in the direction of v, multiplied by the magnitude 
of 

A purely algebraic interpretation of the operation (v • V)A is obtained 
by letting 

(t* • v)yl = G = iGx “1“ jGy + kGz 

Then 


^Ar 

bx 



+ 





bA„^ .OAy^ 
bx by 



[157] 


<7/1, 

d.V 




The operation is thus recognized as amounting to a linear transformation 
of the vector v. If for abbreviation one writes 


ajrx = 


dAx 


dry — 


a/1, 


dx *’ dy 
then the matrix of the transformation is 


a/1, 



[158] 


a = 


dzz 

dyx 

Jdzx 


d-xy d-xz 
dyy dyz 
dzy dzz^ 


[159] 


The operation {v • V)A is thus regarded as a tensor of valence 2, which 
associates a vector G with a given vector v at any point in space. The 
elements of the matrix (i, defined by Eq. 158, are the components of this 
tensor. Their values depend upon the vector function i4 . G is said to be a 
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linear vector function of ?, the characteristics of this relationship being 
determined by the nature of A . 

In passing it may be of interest to note that the operations of forming 
the curl or a vector product may also be regarded as linear vector trans- 
formations. Thus the vector, curl A, is expressible as a linear transforma- 
tion of A with the skew-symmetrical operator matrix 


0 

d 

dz 



(9 r9 

dy dx 


0 


[ 160 ] 


and the vector product /I 5 is expressible as a linear transformation of B 
with the skew-symmetrical matrix 


0 -A^ 

A, 0 

Ay Ax 



[ 161 ] 


The time derivative of the product of two vector functions is evaluated 
according to the familiar rule for the dilTerentiation of a product. For the 
scalar product, the result reads 


dl 


(A 




dB 

dt 


[ 162 ] 


and similarly for the vector profiuct 




dA 

dt 


B + A 


dB 

~dt 


[ 163 ] 


in which it is important to preserve the order of the two functions. 


16 . Additional useful vector relations 

Although the discussions of the preceding articles suffice to determine 
the results of more complicated operations involving combinations of 
scalar and vector functions, it is useful to have available several additional 
formulas covering those relationships which are encountered more fre- 
quently in practical problems. The following items are concerned with 
derivations of this sort. 

(i) The divergence of the prodiict of a scalar and a vector. 

The scalar function is denoted by U and the vector function by A. 
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Then 

div {UA) = y^ {UA,) + ^ {UAy) + ^ {UA,) [164] 

By the rule for the clifTerentiation of a product, this is seen to yield 

div {UA) = U div A + A • grad U [165] 

(ii) The curl of the product of a scalar and a vector. 

Here the rule for the differentiation of a product may be expressed in 
the following way : 

curl {UA) = V^{UA) = V-{UA)u + {UA)a [166] 

in which the subscripts mean that the functions U or A are to be treated 
as constants. According to the definition of the curl by Eq. 104 it is 
readily seen that 

V-{UA)v = U curl A [167] 

and 

V = —A xgrad U [168] 

Hence 

curl {UA) = U curl A A ^grad U [169] 

(iii) The divergence of a vector product. 

This may be written 

div [yl X B] = V • [^1 X B] = V • [/I X S ].4 + V • [^ X B^b [170] 

the subscripts having the same significance as before. From Eqs. 26 and 

38, 

V . U X B]a = • [V X B] = -A • curl B [171] 

V . [yl X 5]^ = B . [V X ^i] = ^ . curl A [172] 

so that 

div [^ X B] = B • curl A A • curl B [173] 

(iv) The curl of a vector product. 

Here 

curl [id X B] = V X [^ X B] = V X [^ X B]^ + V x [^ x B]^ [174] 

Applying Eq. 32 for the triple vector product gives 

Vx[^xBL4 =A{V‘B) - {A-V)B [175] 

and 

V X [^ X B]/, = (B • V)A - B{V • A) [176] 
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The operations yl -gradient and /^-gradient arc defined by Eq. 151 in the 
previous article. Thus 

curl [.1 - B] = A div B - B div • V)A - {A • V)B [177] 

(v) The vector product of a vector ami the cur! of another vector. 

Here Eq. 32 is applied to the triple vector product 

.1 >curl B = A [178] 

the vector function A being treated as a constant. Thus one obtains 

A-[N-B] = V(A - B)a - (A • V)B [179] 

or 

A ^ curl B = grad (/I • B)a — (A • V)B [180] 

(vi) The gradient of a scalar product. 

This reads 

grad (^1 • B) = grad (A • B)a + grad (.4 • B)ii [181] 

By Eq. 180, 

grad (.1 • B)a = -1 ^ curl B -f (J • V)B [182] 

grad (A • B)ji = B^ curl .1 -r {B • Vj.l [183] 

Hence 

grad {A • B) = A curl B + B ^ curl A + (A • V)B + (B • V).l [184] 

fvii) The volume integral of the scalar product of a potential and a solcnoidal 
vector function. 

A purely potential field is described by the vector function /I, and a 
purely turbulent field by the vector B\ that is, 

curl = 0 [185] 

and 

div B = 0 [186] 

throughout all space occupied by these fields. In the volume integral 

f (A - B) dv 1 187] 

which is extended over all space, or over that portion to which the field 
B may be confined, the element of volume dv is represented by an ele- 
mentary length ds of any one of the closed tubes characterizing the flow 
map for the solenoidal field B. If the cross-sectional area of a flow tube 
is denoted by da, the integral 187 may be written 

f{A B)dv = J f \B\ da {A ■ ds) [188] 
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Here the vector element of length ds of a flow tube has the same direction 
as B. The integration with respect to ds extends around the closed circuit 
of a flow tube, and the integration with respect to da extends over all the 
flow tubes. 

According to the definition of a flow tube, \B\ da is constant through- 
out the closed circuit mapped by this tube, so that this factor may be 
placed before the integral with respect to ds. Hence 


J {A ■ B) dv = J l5| da • ds 

[189] 

Since A defines a potential field, it follows that 


• ds = 0 

[190] 

so that the final result reads 


f (A- B) dv = 0 

[191] 



17. The vector r 

In connection with many field problems it is convenient to introduce 
a vector r which represents the vector distance between two points P 
and Q in space as shown in Fig. 17. The {X)int Q may be the location of a 
cause (such as a source or vortex) whereas P is the point at which the 
resulting field is observed. 'Fhe vector r is assumed to point from Q to P. 
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Hence if the co-ordinates of Q are denoted by f , and those of P by 

X, y, s, the components of r are given by 

Tx = (^ - i). ry = {y - I?), ry= {z- f) [192] 
and the vector r is 

r = i(x - f) +j(y - It) + k(z - f) [193] 

The magnitude of r is expressed by the scalar function 

r = V {x - + {y - v)'^ + (s - [194] 

In discussing various operations in terms of either the vector r or the 
scalar function r, a distinction must frequently be made according to 
which of the points P or Q is considered to be variable. If Q is foced and P 
variable, the operations of differentiation or integration apply to the 
co-ordinates x, y, z. This state of affairs may be indicated by attaching a 
subscript P to the operator in question. Similarly, a subscript () indicates 
that P is considered fixed, and the variables {, 77, f, are affected by the 
operator. For example, 

gradp = »£;+y^ + ^^ [195] 

whereas 

Similar distinctions apply to the operator div and curl. 

Because of the symmetrical manner in which the variables x, y, z on the 
one hand and J, 77 , f on the other, enter into the expressions 192 for the 
components of r, the difference between a given operation as evaluated 
for the subscripts P and Q is readily recognized. Thus, for the o] 3 erations, 
grad, div, curl, for example, the result for a subscript P is simply the 
negative of that for a subscript Q. 

In the following more specific discussion, therefore, the variables are 
always assumed to be x, y, 2 . Without loss in the generality of this dis- 
cussion, the fixed point Q may then be assumed to be coincident with the 
origin of co-ordinates. This simplifies the expressions 192, 193, and 194 to 

rx = X, ry = y, Tz =- z [197] 

T==ix+jy + kz [198] 


and 


r = + 372^^2 


[199] 
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In terms of these expressions it is readily seen that 

dT Qt Qt 1 r 

gradf = +i- + = - {ix +jy + kz) = - [200] 

In other words, the gradient of the scalar function r is equal to a unit 
vector in the direction r. 

For the evaluation of the gradient of a function of r, it is useful to 
observe that 


^ dr\dx '^dy dz/ 


Hence if /(r) denotes a function of r, and the partial derivative df/dr is 
written /(r), 


For example, if 


grad/(r) = fir) grad r = / (r) ^ 


Sir) = r» 


grad r" = [204] 

An application of this result to a commonly occiuring form reads 


According to Eq. 60, the divergence of the vector function r is 

divr = — + ^ + — 
dx dy dz 


and with Eq. 197 this gives 


divr = 3 


From Eq. 165 it follows that 

div (r"r) = r" div r + r • grad r" 
By substitution from Eqs. 204 and 207, this yields 
div (r"r) = (h + 3)r’‘ 


This result, together with Eq. 204, shows that 

div grad r" = W = n(« + l)f"-2 [210] 

In particular, for » = - 1, this gives the important result 



[ 211 ] 
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Evaluating the curl of the vector function r by substituting the com- 
ponents 197 into Eq. 104 shows that 

curlr = 0 [212] 

and the use of Eq. 1 66 then shows further that 

curl (r”r) = 0 [213] 

In particular, 

curl = 0 [214] 

The vector function r, or (r^'r), may be looked upon as defining a pure 
potential field. 

A number of vector relations formed from the function r in combination 
with an arbitrar>" vector function A are also useful. Using the result 
expressed by Eq. 184 together with Eq. 212 yields 

grad (r • A ) = r x curl A + (r • V)A -f {A • V)r [215] 

But 

+ = 4 [ 216 ] 

SO that 

grad (r • A) = r ^ curl A + (r • V)A + A [217] 

Equations 173 and 212 show that 

div [r ^ A] = — r • curl A [218] 

Whereas Eqs. 177, 207, and 216 yield 

curl A] = r div A — 2A — (r • V)A [219] 

18. Curvilinear co-ordinates 

Because of the geometry inherent in some physical problems, it is 
advantageous to designate various points in space in terms of co-ordi- 
nates other than the rectangular Cartesian ones which have been used in 
this discussion so far. In a problem which exhibits cylindrical symmetry, 
for example, it is usually effective to use cylindrical co-ordinates; if the 
physical problem has spherical symmetry, spherical polar co-ordinates 
are usually preferable, and so on. 

The variables occurring in connection with any of these curvilinear 
co-ordinate systems may be denoted by m, w, as contrasted with the 
variables x, y, z, used for the rectangular Cartesian system. In the 
Cartesian system the curves for constant values of y and s, z and a:, and 
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X and y form mutually orthogonal families (which in this instance are 
straight lines). For this reason, the rectangular Cartesian co-ordinates 
are said to be orthogonal. 

In a curvilinear co-ordinate system, the curv^es for constant values of 
V and w, w and u, and u and v may also form mutually orthogonal fami- 
lies, for example, in the case of cylindrical or spherical polar co-ordinates. 
The curvilinear co-ordinates are then said to be orthogonal. The dis- 
cussion of the present article is restricted to systems of this sort since the 
need for a more general system very rarely arises in practice. Moreover, 
the mathematical apparatus necessary for dealing with perfectly general 
curvilinear co-ordinates - the absolute differential calculus — requires a 
detailed study of considerable depth, which seems warranted only when 
suflicient use for it in connection with other practical problems presents 
itself. 

For the orthogonal curvilinear co-ordinates, a set of mutually orthogo- 
nal unit vectors are defined which are analogous to the vectors 

ij, k in the Cartesian system. The unit vector at any point in space is 
tangent to the curve v = constant, 'w = constant, at that point. Simi- 
larly, is tangent to the curv'e lu = constant, u = constant, and is 
tangent to the curve u = constant, v = constant. The directions are, 
moreover, so chosen that i,,. form a right-hand system. 

It should be observed at this point that the scale of length which is 
implied in the designation of /v, as unit vectors is that pertaining 
to the rectangular Cartesian co-ordinate system. The scales for the 
curvilinear co-ordinate axes w, ?, w at any point are in general different 
from this scale of length, and are moreover a function of the position of 
the point in space. In other words, the magnitudes of the increments 
duy dv, dw as measured in the Cartesian co-ordinate system are 

dSn — Cn du 

dsv ^ e„dv [220] 

dsxc = dw 


in which Cu, are factors accounting for the differences in the scales 
for the Hy V, and w co-ordinate axes at the point in question and the scale 
of length of the Cartesian system. 

A given vector increment of length ds is expressed in the Cartesian 
co-ordinate system by 

ds ^ idx +j dy + k dz [221] 

and in the orthogonal curvilinear system by 

ds ~ dsyj^ “F ds\y “F ds \0 [ 222 ] 
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The length of the vector increment ds in terms of the Cartesian co- 
ordinates is determined by 

(ds)- = (dx)- H- (dy)- -f- (dz)- [223] 

whereas in terms of the curvilinear co-ordinates 

(dsY = (dsu)'^ + (ds,y- + (ds^)^ [224] 

or 

(ds)- = eY(dn)- + eYidv)^^ + eY(dw)'^ [225] 

it being understood that this length is measured by the scale of the 
Cartesian system. 

The scale factors Cu, c,,., which in general arc functions of the position 
of the point in question, that is, functions of .v. y, c or v, may for 
given curvilinear co-ordinate system be found in several ways. In simple 
cases the expressions for ds,^, ds^.^ and ds,^. may be written down l^y 
inspection; whence the scale factors according to Eqs. 220 become 
e\ident. A more formal method of determining them follows. 

For any particular curvilinear co-ordinate system, the co-ordinates 
X, y, z are expressible as functions of //, r, thus 

= /i (M.f.ii') [226] 

y = h{u,v,u’) [227] 

3 = faiUtViU') [228] 

According to the rules of the differential calculus, 

— dll -V ~ dv + — dw = dx 

dll dv dw 

— du -j- — dv + ^ dw = dy [229] 

du dv dw 

— du — dv + — dw dz 

du dv dw 

or 

^dsu + ^ dsv + ^dsu, = dx 

0^1^ dSy ^^10 

— + — ds„ + ^ds^=‘dy ' [230] 

— dsu + — dsy + — ds^ = dz 

ds%^ dSp ds UP 
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In terms of the transformation matrices 

r^/i 


^ ^ 

dll dv dw 

dU dj^ dj2 

dll dv dw 

^/a ^/a 

dll dv dw 


a/, 

9h 

dh 

dSu 

dSv 

dSu- 




dSu 

dSi. 

ds„- 



dh 

dSu 

dSv 

ds 


the relations 229 and 230 may be written in the matrix form 


dv = dv 


dsu dx 
dsr, = dy 
_^dstcj 


From Eqs. 220, it is clear that 
r dsu n r^u 


and hence that 


ds„ = 

dSuA 


rdii 
X dv 


fcu 0 0 

y = y J 0 e, 0 


Since both the Cartesian system and the curvilinear system are 
orthogonal, it follows (see Art. 4, Ch. Ill) that the matrix 7 a in the 
transformation 234 is orthogonal, and hence that 

{7.)i = 


[237] 
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the subscript t denoting the transposed matrix. But from Eq. 236 


~eu 0 0 

0 Cp 0 

_ 0 0 Cu, 


= 

and with Eqs. 236 and 237 this gives 

= 




~Cu- 0 0 

0 c/ 0 

_ 0 0 


Hence, by means of Eq. 231 it follows that 

(ST -(sr 
--(ST -(ST -(IT 
-(ST -(ST -(ST 

from which the scale factors are determined. 

As an illustration, the cylindrical co-ordinates 
may he considered, for which the r-axis is chosen 
coincident with the axis of cylindrical symmetry 
^ (Fig. 18). The relations 226, 227, and 228 then 
read 


[238] 

[239] 

[240] 

[241] 

[242] 



Fig. 18 . Cylindrical co- 
ordinates with the z-axis 
as axis of cylindrical 
s>Tnmetr>'. 


X = r cos <f> 
y = r sin <t> 
2 = 2 


[243] 

[244] 

[245] 


The variables r, (p, z are chosen to correspond 
respectively to u, v, w. Then Eqs. 240, 241, and 242 yield 

Cr^ = cos* <t> + sin* <) + 0 = 1 [246] 

= r* sin* <^> + r* cos* </> + 0 = r* [247] 

e,* = 0 + 0 + 1 = 1 [248] 

Hence 

Cr = I, = r, c* = 1 [249] 

are the scale factors for this case. The vector increment of length is 

ds = ir dr + i 4 ,rd<l> + it dz [250] 
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that is. 





dsr = dr, ds^ = rd<t>, 

dsz = dz 

[251] 

and 





(dsf = (dr)^ + r^{d4>Y 

+ {dzY 

[252] 


In this simple example, the last three results can, of course, be written 
down at once from inspection of Fig. 18. 



Fig. 19. Spherical polar coordinates. 


The geometrical relations between spherical polar and rectangular 
co-ordinates are illustrated in Fig. 19. Accordingly, Eqs. 226, 227, and 


228 become 

X = r sin Q cos 0 [253] 

y = r sin 0 sin 4> [254] 

s = r cos ^ [255] 

The variables r, 0, <t> are chosen to correspond respectively to w, w. 
Then Eqs. 240, 241, and 242 yield 

= sin- d cos- <t> + sin^ 6 sin^ <t> + cos^ ^ = I [256] 

ee^ = cos^ 6 cos^ <l> + cos^ $ sin^ <t> + sin^ 6 = [257] 


sin^ $ sin^ <l> + sin^ 6 cos^ <^ + 0 = sin^ 6 [258] 


Hence 


Cr = 1 , ea = f, €4, = f sin^ 


[259] 
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are the scale factors. The vector increment of length is* 
ds — ir dr + ur dd + sin 6 d<l> 

that is, 

dsr — dr y ds 9 = r dd ds^, = r sin 0 

and 

{ds)“ = (dr)^ -{- r^^idsY + r^ sin^0 (d4}Y 


[260] 

[261] 

[262] 


The most commonly used vector operations are the gradient, diver- 
gence, curl, and div grad, or T-. Formulation of these in terms of any 
orthogonal curvilinear co-ordinates «, u* and specifically for the cylin- 
drical and spherical polar co-ordinates are discussed in turn. 

The expression for the gradient of a scalar function U reads 


. dU dU dU 

grad U = tu- h iv — + tw — 

dSn dSy dS%jD 


[263] 


or 


ATT ■ 

grad U = *« — — + *„ 

Cu dU 


Cf) dv 


+ iw 




[264] 


With the help of Eqs. 249, this becomes, for cylindrical co-ordinates, 


ATT . \dU dU 

grad — 

dr r d<f) oz 


[265] 


and for spherical polar co-ordinates, substitution from Eqs. 259 gives 


, ,, , dU , \dU . 1 dU 

grad U tr — + t9~ —T trtr — r— : — 

dr r dd r sin d d4> 


[266] 


For the formulation of the general expression for the divergence of a 
vector function A, the definition given by Eq. 56 is applied to a curvi- 
linear parallelepiped (Fig. 20) enclosed by the surfaces u = constant, and 
u du = constant; v = constant, and v dv = constant; w = con- 
stant, and w + dw = constant. Except for differentials of higher order, 
the elementary areas of opposite faces of this parallelepiped are equal. 
Specifically, the elementary areas of the faces normal to iu, ivy iw are 
given respectively by 

don — dSy dSxff ~ ^xfiw dv div [267] 

da„ = dsy, dsu = e^eu dw du • [268] 

da,o = dSu dsv = du dv [269] 

this example, one may also write down the expression for ds directly and thus obtain 
the scale factors without applying the formal method. 
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The volume of the parallelepiped is 

iV = isv dsu, = euC^w du dv dw 
The surface integral of is given by 

A, * da — u “* t^uddulu “I” \jA ‘t}dQf^i}.^^-fj ^jA^dfd'^^ 

“1“ yu ddiffj w-j-dw ”” \.A VO ddiv^ V 



Fig. 20. A curvilinear parallelepiped formed by surfaces of constant u, v, and w. 

in which Au, Ay, A^ are the components of A in the directions of iu^ ivt iw 
respectively, and the subscripts indicate that the quantity enclosed by 
tjie brackets is to be evaluated at the points u + du, v, w; u, v, w; u, 
V + dv, w; u, V, w] etc. Now 

14 d(jir\ji^n.^^x^ ~ dfCLy^xi “f" ~ dfdry^v^du jr272j 

du 


so that 


14^14.^ 14 dci|4j|4 — \jA%^ dfCLy^xidU 

du 


= — \_Auer^vo\udu dv dw [273] 
du 


and so forth. Hence 


/ d d d 

^ . da = — (j4„CrC„) + — (i4,;C„Cu) + — dudvdw [274] 

du dv dw 
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SO that the general expression for the divergence becomes 


div A = 


^ ^ • da 

ds n ds i' ds lit 


{Au^vCic^ “h {AirCii-Cu^ -f" [“^'^1 

euCvCw dv dw 1 

In cylindrical co-ordinates, this reads 

div A = {Arr) + ^ (^4) + |- [276] 

r [dr dq> dZ 


1* ^ ^ ^ \ 1 dAz 

div^ = + — [2,1] 

In spherical polar co-ordinates, 




1 dAij, 

r sin d d<f> 


The expressions for the components of the curl of the vector function 
A are obtained through applying the defining relation 89 to elementary 
curvilinear rectangles lying in planes normal to the directions of 
and iu,. Except for differentials of higher order, the opposite sides of such 
a rectangle are equal. Considering the w-component, the sides of the 
rectangle are dsv = dv and dsw = dw. The line integral of A taken 
around this rectangle is expressed by 

A * ds “ u ds J to “f~ \_A Iff ds ipj V-^-dV V d'S yj XO-^-dtO to ds lyj y [^ 280 j 


[i4ty dSij^ff^dv — [<^t0 "1“ ^ [^10 ^'^lolv 

dv 


[A V ds^ iff^dw = [A V w + — \^vds^iffdw [282] 

dW 
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so that Eq. 280 becomes 

A * (is ~ w ds J ij dv ]i.A v ds J to d*w 

u OV OW 

or 

^ A • ds — ' ~ (^A u'^w^v ~~~ _ ~ ^A v^v^wf dv div 
I [dv dw J 

The enclosed area is ddu = dv dw. Hence by Eq. 89, 

1 f 5 d 

curltt A = 1“ (-^lu-Cu;) “ {AyCy^ 

Ci-Cu; [dv dw 

and similarly 

\ { d d 

CUrlf 1“ {AnCu^ 7 (-^ ir^io) 

Ca-Cu dli 

1 f* 5 d 

curl 14 - yl = IT” (AvC^;^ ~~ {A^Cu) 

CuCv [dll dv 

In cylindrical co-ordinates, these expressions become 

r d<t> dz 


cur4 A = 


dz dr 


curUyl = - — (ry^) — 

r dr r d<t> 

and in spherical polar co-ordinates, they read 

1 d 


curb A = 


r sin 0 dd 


curbyl = 


243 

[283] 

[284] 

[285] 

[286] 

[287] 

[288] 

[289] 

[290] 


li. 

r dr 


, . . . 1 3.4, 

(T«,sin(9) . 

r sm B d<t> 

[291] 

> r dr 

[292] 

f dd 

[293] 


The evaluation of V^U is obtained through substituting the components 
of grad U from Eq. 264 for the components of A in Eq. 275. This gives 
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For cylindrical co-ordinakSy this yields 




r dr \ dr ) ^ r^ d</>^ ^ dz^ 


[295] 


In spherical polar co-ordinates ^ 

1 a/. dU\ 1 d^u 

r“dr\ dr) r“sinddd\ ^ dd ) r“' sin^ d 


[296] 


The first term on the right-hand side of this equation may be written in 
other forms by noting that 


iA/ 2^^ _ ^ ^ 1 d-(rt/) 

r"dr\ dr) dr- ^ r dr r dr^ 


[297] 


PROBLEMS 

1. Let Ay By C hQ three vectors extending from a fixed point, O, to the points 

a, hy Cy respectively. Express the directed segments ah and ac in terms of /! , B, C. If b 
lies on the line ac and divides it in the ratio 

ah 

— =* m 
ac 

write the expression for B in terms of Ay Cy and m. Conversely, \{ B = ntA -f wC, 
where m and n are scalars such that w -f « = 1, show that b must lie on ac and find the 
ratio 

ah 

ac 

2. Let Ay By Cy D ht four vectors extending from a fixed point O to the points 
a, by Cy dy rospoc 1 1 voly . As in Prob. 1 , show that the necessary and sufficient conditions 
>ielding a, by c, d coplanar are expressed by the relations 

a -h /3 -1-7 = 1 

D — aA A- PB 4- yC 

in which a, y are scalars. 

3. The vectors from the origin to the points a, b, Cy d are 

A ~ i -\-j -f k 

B ^ il At 
C = *3 -h 7*5 - k2 
D ^ -j A- k 

Express the vectors aby bcy cdy da in terms of the unit vectors iyjy k. Show that ab and 
cd are parallel. 
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4. Prove that the absolute value or length of a vector A is given by 

A = VA- A 

Find the lengths of the vectors A , B, C, and D in Prob. 3. 

5. Evaluate the following scalar products: 

(i) (t3 +i4 + kl) • O’ - ^5) 

(ii) (t -\-j2 - ^3) • (t4 4-i + k2) 

(iii) (tSO + ;^35) • (-t3 - yi7 + k2) 

6. Verify Eq. 3, page 188, for the case where 

A = i2 jZ — k 
B = -i -i 
C = i — j2 
D = -i2 -hj +k3 

7. Find the cosine of the angle between the vectors 

A = i -j2 - k2 
B = i2 - k2 

8. The sides of a triangle are vectors A , B, and C, of length a, 6, and c, respectively, 
such that 

. 1 - B - C 


By using scalar multiplication, square both sides of this equation and, by interpreting 
the result geometrically, obtain the law of cosines. 

9. If .4 and B are nonzero vectors, show that the necessary and sufficient condition 
for A and B to be: 

(i) parallel is .1 ^ B = 0 


(ii) perpendicular is -4 • B =0 

10. Evaluate the following vector products: 

(i) (i2 +j2 - k)» (t +j) 

(ii) (iS -j + k)x(i2 -j + k2) 

(iii) («■ +j + k)* (i +j + k2) 

11. If .1 and B are sides of a triangle, show that the area of the triangle is given bv 

area = (.-1 » B) • (.4 x B) 


Calculate the area when 


A ==i+j + k 
B =i2 + k3 


12. Prove Lagrange’s identity 

(/IxS) - (C»Z)) 


(A-C) 

(BC) 


(A ■ D) 
(BD) 


Use this to find an alternative form for the area formula given in Prob. 11. 
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13. Using the vectors given in Prob. 10 (iii), compute P,, Fy, F-, and the projec- 
tions of Eq. 14, page 191. Find the angles By,, B„y Oxy, and in terms of 

these results verify Eqs. 16, 17, and 18, page 191. 

14. If .4, jB, C are coterminous edges of a tetrahedron, show that the volume of the 
tetrahedron is given by 

volume = ^1.4 ^B^C\ 

Calculate the volume when 

A - iS -j -Vk 
B =1 -fi-Fife 
C = i-k 


15. Show that 

(AyB)y(CyD) = (A^B-D)C - {A^B C)D 
= (A^C-D)B - (BxC*Z>M 

16. Prove the formula 

(.4 -B)- (B^C)^ (C^A) = (A^B- C)® 


17. If U* = X- -h 4y- -h 16c-, find grad U at the point P(2,2,l). WTiat is the shape 
of the constant value surface through P? Find the directional derivative of U, 

ds 

at P, if the direction of 5 at P is along the vector 

A = — » + j2 + k2 

18. Verify Eq. 48, page 198, if 

U = X- +yz 
V = xyz 


19. Find a fimction U {xj^) with gradient equal to 

grad U = ilx + jSy* + k4z^ 

20. In fluid dynamics, there arises the energy function 

= f ^ 

Jpo P 

where the density of the fluid p{Xjy^) is a function of the pressure p(x,y,z). Find the 
general expression for 

grad<^ 


If in particular 


^ = (** + y* + a*) - i 


and 


p ^ cp (c, a constant) 
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show that 


grad«= - + 


c (:c2 + >^2 ^ z^) 

21. ’Let U = + 16z“ define a potential. Calculate the line integral 

S grad U • ds 

from Pi(l,l,l) to P2(2,2,2): 


(i) By integrating along straight lines from (1,1,1) to (2,1,1) to (2,2,1) to (2,2,2). 

(ii) By integrating along the straight line PiP2- 


22. Calculate the divergence of the following vector fields at the points indicated. 
Which of the points represent sources? Sinks? 

(i) .1 = i{x- - y-) 4-7(2“ - X-) 4- - 2“) at (2,3,1) 

(ii) A = (ix +jy 4- kz)(x' 4- y^ 4- 2^) at (-1,2,2) 

(iii) .1 = {i3z -jSx 4- ^5y)(xy) at (1,0,0) 


23. By considering the defining equation 56, page 201, calculate div (ix -{-jy+ kz) 
at the origin, using a sphere of radius Ar, with center at the origin, as the infinitesimal 
volume. 

24. Determine the volume of fluid per second flowing out of a spherical region of 
radius 3 feet if the vector velocity field for the region is given by 

V - ix^ 4- jy^ 4- kz^ feet per second 


referred to a set of axes with origin at the center of the sphere. 

25. Calculate the volume of fluid per second flowing into a cube measuring two 
feet along each edge if the vector velocity field is given by 

F =* - 4- 7y “2 -f kz'x feet per second 

referred to a set of axes with origin at the center of the cube and such that the x-, y- 
and 2-axes are {perpendicular to a face of the cube. 

26. Show that the volume enclosed by a closed surface 5 is 

volume = ^ 



where r is the distance from the origin to the variable point; that is, 
r =: \/.v2 + y2 + 22 . 

27. Find the potential U(xj,z) associated with a vector force field which is 
directed toward the origin, with magnitude inversely proportional to the distance 
from the origin. (Hint: By symmetry, U will be a function only of r, the distance from 
the origin.) 

28. The velocity of a fluid is radially outward from a point source and is propor- 
tional to the distance from the source. Find the velocity potential associated with this 
vector velocity field. 

29. A sphere of radius a with center at the origin contains a space charge of density 

p - a7 — (where r - Vx* 4- y^ 4- 2 *) 

There are no charges exterior to the sphere so that the resulting field will possess 
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spherical symmetry, that is, 

A = /(r)r (where r = ix -f jy + kz) 

Using Eq. 64, page 205, on a sphere of radius r, evaluate /(r) and thus show that 

= — 3 r (when r > a) 


and 



(when r < a) 


30. In the preceding problem the potential U associated with the field A may be 
computed from 

t; = jf ^ . dr 

(This may be thought of as the work done by the field when a unit charge, upon which 
it acts, moves from r out to infinity.) Calculate the expression for U valid outside the 
sphere and show that U satisfies 

VW = 0 


Also find the potential U valid for points inside the sphere and show that here U 
satisfies 

V'U = ~47rp 

31. On the surface of a sphere of radius a and center at the origin is a surface 
charge of constant density cr. By the same procedure used in Probs. 29 and 30, de- 
termine the vector field A , and the scalar potential U, for points interior and exterior 
to the sphere. Show that Laplace’s equation is satisfied for all points except on the 
surface of the sphere, and that on the surface itself, Eq. 68, page 206, is satisfied. 

32. Calculate the curl of each of the following vector fields: 

(i) A = -iy -hjx 

(ii) A = i3xy +j2yx + kyz 

(iii) A = ixyz +j(x“ + + Z‘) A- k(x A- y +z) 


33. Verify Stokes’s theorem by computing separately 


A • ds 


and 


X 


(curl A ) ' 


da 


in the case where 


A - iy -- jx 


and the surface S is the hemisphere of radius 6, above the ary-plane, whose boimdary 
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L is the circle 

in the jcy-plane. 
34. Evaluate 


= 6 * 


§ 


A • ds 


around a square of side ft, which has a comer at the origin, one side on the x-axis, and 
one side on the 3 ^-axis, if 

A = ,-(x2 _ y2) A-j2xy 

35. Consider an infinitely long, straight vortex thread of constant moment /. By 
using Eqs. 102 and 103, page 213, together with the fact that the resulting field will 
possess cylindrical symmetry (that is, the lines of force will be circles about the vortex), 
deduce the expression for the vector field .1. Use cylindrical co-ordinates with the 
2 -axis along the vortex. 

Show that the potential which satisfies Eq. 130, page 220, is given by 

r - 2/(00 - 0 ) 

where 0 is the angle in cylindrical co-ordinates. Verify that this is a multivalued 
potential function which satisfies Eq. 136, page 222. 

36. Consider an infinitely long right circular cylinder of radius b whose axis is the 
2 -axis of cylindrical co-ordinates. Interior to the cylinder is a vortex density J defined 
as follows; the direction of J is always parallel to the 2 -axis and the magnitude is 
given by 

|y| = 

The exterior of the cylinder is assumed to be vortex-free. Using the same sort of 
reasoning as in the preceding problem, determine the vector field A, obtaining ex- 
pressions valid inside and outside the cylinder. For the inside of the cylinder, find the 
vector potential P which satisfies 

curl P = A 
div P = 0 


{Hint: From Eqs. 126 and 127, page 219, it can be seen that the direction of P is the 
same as that of 7.) 

For the outside of the cylinder, find the scalar potential U which satisfies 

grad U = —A 

37. A point P(x,y,z) moves along a space curve in such a way that its co-ordinates 
are given by the following functions of time: 

X = a cos / 

y = asm t 

z = bt 


Find the vector velocity and acceleration of the point P. 
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38. The potential energy of the region in the preceding problem is given by 


U = 


-r- — sin t 
x‘ 4- y- 


Find dU dt for the point P(.v,y,s) of Prob. 37, at time /. 

39. With reference to Probs. 37 and 38, let .1 = — grad U. Determine <7.1 dt for 
the same P(x,y,z) at time /. 

40. Let 


A = i(x- -f yc) +y(y‘ + -.v) + + .ry) 

B = i{xy) -\-jiyz) 4- k{zx) 

Calculate: 

(i) grad U 

(ii) div.l.divP 

(iii) curl .1 . curl J5 

(iv) div (T-l) 

(v) curl ir.l) 

(vi) div (.1 • B) 

(vii) curl (-1 ^ B) 

(viii) grad (.1 • B) 

41. If .4 is a constant vector, prove 

V(.4 . r) = ^ 


42. Let and V 2 be vectors from the fixed points (.vi,yi,Si), (x 2 ,y 2 ,Z 2 ) to the 
variable point (A:,y,z). Show that 

fi ) div (I'l ^ Vo) = 0 
fii) curl (Ti^ To) = 2(Fi — F 2 ) 

(iii) grad (Ti • To) = Fi 4- V 2 

43. If is an arbitrary vector field, evaluate 

X V) • P 

and 

{A ^V)-R 


44. Obtain Eq. 287, page 243, directly from 


r^.rr, x d^U 


by the substitution given by Eqs. 235 to 237, page 237. 

45. Obtain Eq. 288, page 243, directly as in Prob. 44 by means of the substitution 
given by Eqs. 245 to 247, page 238. 
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46. Suppose that 

yp = ypiai) 
Oil = \pi{a2) 
Of 2 = 


OCn - a„(u,v,w) 

a series of relations through which yp is e.xpressed as a function of w, v, w. Show that 

,, r*/ (^\p daida 2 dan-i „ 

grad p ^Vp = - - — Va, 

do[iaoc200i3 dan 

47. Apply the formula of the preceding problem to compute the gradient of 
P = lnsin/‘*''^<"*‘'^> 

in which 

A = iudi + IrU2 + iudz 


B = iuH + i,v + 


the quantities <72, (I 3 being constants. 

45. Consider a rectangular co-ordinate system with origin at the point 0 and let 
() and P be two other points separated from O by the distances p and R respectively. 
I-et r denote the vector distance from () to P (as defined in Art. 17) and similarly let 
p and R denote the vectors 0() and OP. The angle between the latter two vectors is p. 
If () is assumed fixed while P moves upon the surface of a sphere with radius R, 
compute the magnitude and the direction cosines of the vector gradp cos Show that 
this vector is perpendicular to R. 

46. With reference to the previous problem compute the magnitude and direction 
cosines of the vector graefi; cos p when P is fixed and () moves over the surface of a 
sphere of radius p. W hich vector, p or R, is perpendicular to grad<^ cos p? 

50. Upir) is some scalar function of the distance r as defined in Prob. 48, show that 


grad/»<^ 


dp 

^;grad,.r 


- grad<2r 


Compute the magnitude and the direction cosines of this vector if 


P = 


-jkr 


r 


(A’ being a constant) 


51. Referring to the statement of the previous problem, show that 

Vg ”<^ = V />‘“0 = — ]z^p 

52. If a is a constant vector and p{r) a scalar function of r, show that 

Vg X (ap) = a X Vpp 


and 


Vg X Vg (Bp) — Vg(a * VqP) — BV Qp 

S3. Prove the vector identity 


E ^-^[A^ExA + (£-.4M] 
.4 • A 


in which E and A are any vectors. 
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54. Prove the following vector relations: 

£ • (V '< V X a</>) = k‘<f)E • a + £ • V(a • V0) 

£• V(a-r</») = V- [(a- V<^)£] - (a- V<^)(V*£) 

£ • (V X V X a0) = a • {k^-<t>E - (V • £)V0! + • [(a • V0)£] 

in which £ is a vector function of .r, y, z, <5 is the scalar function defined in Prob. 50, 
and a is a constant vector. 

55. If A and B are any two vector functions of the space co-ordinates, show' by 
means of Gauss’s law that 



• V X V X ^1 — ,4 • V X V X J5) i/r 



V X 5 - B X V X ^1] . da 


in which da is a vector element of the surface 5 and dv is an element of the enclosed 
volume V, 



CHAPTER VI 


Functions of a Complex Variable 


It is assumed that the reader has some acquaintance with the subject 
of complex numbers and the representation of them in the complex plane 
(also known as the Gaussian plane). Here it is customary to consider the 
x- 2 Lxis as the '' axis of reals ” and the y-axis as the “ axis of imaginaries.” 
The symbol j = V— I prefixed to a real number signifies that the latter 
is the imaginary or y-component of a given complex number. The y-axis 
is, therefore, sometimes referred to as the “j-axis."’ 

A complex number s = .r -h jy is plotted in the complex plane as the 
point P(x,y), and z is interpreted geometrically as a vector* drawn from 
the origin to this point. This interpretive procedure is referred to as the 
rectangular representation of the complex number (or vector) z. Its 
polar representation is given by z = whence p = Vx^ + y^, and 6 = 
tan*”^ iy/x). The familiar law of parallelogram addition applies to the 
addition of any given set of complex numbers. The details of this process 
as well as those involved in the multiplication and di\'ision of complex 
numbers are not further elaborated upon here. 

The object of the discussion in these pages is rather to give the reader 
some acquaintance with a complex function whose independent variable 
is a complex number. The most obvious novelty exhibited by the function 
of a complex variable, contrasted with a real function of a real variable, 
lies in the fact that the values of both the function and the variable 
are no longer characterized by single numbers; two numbers are now 
required for the specification of each. Thus a value of the variable z in- 
volves the specification of the two quantities, x and y, and since the 
function of z is likewise complex, its values also involve the specification 
of a real and an imaginary part. 

In \iew of this situation it is clear that the process of graphical rep- 
resentation requires new methods in the case of complex functions. In 
addition, one should carefully re\iew the fundamental operations of 
the differential and integral calculus in order to see whether their familiar 
interpretation may be extended in some way to apply to complex func- 
tions. This extension should be made with the minimum possible change 
in basic conceptions in order that the many useful relations known with 

•Complex numbers are frequently referred to by electrical engineers as vectors. It has 
been pointed out that the term “ vectors ” is in this connection misused and that such misuse 
may lead one to draw false conclusions or otherwise fall into dangerous byways of thought. 
This view is not shared in this book, principally because the discussion given in Art. 5 shows 
that it has little justification. 


3S3 
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regard to functions of a real variable may become available also to the 
manipulation of the complex ones.* 

The most useful operations familiar to the reader in connection with 
functions of a real variable are those of differentiation, integration, and 
expansion in series. A discussion of the extension of these conceptions to 
functions of a complex variable is the principal i^m of the following 
articles. 

In these discussions no particular attempt is made to. give rigorous 
proofs. The derivations are given entirely for the purpose of providing 
the reader with a partial insight into the relevant fundamental inter- 
relationships. Wlierever possible, an attempt is made to establish contact 
between the new conceptions and those with which the student has an 
acquaintance of longer standing. To the engineer who is tiydng to gain a 
working knowledge of function theory, rigorous proofs arc a waste of 
effort, but plausibility arguments do serve a useful purpose in that they 
provide the necessary circumspection for facile and intelligent use of this 
important mathematical tool, and at the same time lay the groundwork 
for a more thorough study which may seem desirable at a later time. 

1 . Differentl4tion 


The complex function of a complex variable z is denoted by 



'u> = fiz) 

[ 1 ] 

with 


z = X +jy 

[ 2 ] 

and 


U' = w + jv 

[ 3 ] 

Therefore 

u and V are functions of both x and y; thus 



u = u{x,y), V = vix,y) 

[ 4 ] 


It should be observed that both the real and imaginary parts u and v 
are real functions of the two real independent variables x and y, which 
are the real and imaginary parts of the complex variable s. The language 
used here may be somewhat confusing to the reader, inasmuch as v and y 

•This approach to the consideration of the theory of functions of a complex variable may 
appear to some readers to be somewhat strange. It should be remembered, however, that 
these pages are addressed primarily to the engineering student whose previous experience 
with mathematics has been confined almost wholly to real functions of a real variable. To him 
the process of regarding the present discussions as an extension of some of the manipulations 
which apply to functions of a real variable not only appears to be sensible but also is the 
course which his process of learning will take in any case. 
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are spoken of as the imaginary ” parts of w and z respectively, whereas 
at the same time they are pointed out as being real quantities. The 
strangeness of this method of expression should, however, readily be 
overcome by concentration upon its mathematical significance as ex- 
pressed by the relations 1 to 4. 

It is now of interest to examine whether the derivative of the function 
w with respect to the variable z may be interpreted as the limit of the 
ratio Iz, in which and Iz represent corresponding increments, as 
the increment Ac approaches zero. Reflecting upon this situation at once 
discloses an apparent difficulty, since one is reminded of the fact that 

Ac = Ax -h j Ay [5] 

and hence that Ac may be interpreted in an infinity of ways. If one 
assumes for the moment that the increment Ac has a fixed magnitude, 
its direction in the complex plane may be varied in an infinite number of 
ways, thus yielding an infinite number of corresponding increments 
in the function. It does not necessarily follow, of course, that the ratio 
Alt’' Ac corres{X)ndingly assumes an infinite number of values, but unless, 
in the limit Ac 0, this ratio is independent of the direction of Ac, the 
derivative of the function %ji: evidently does not possess a unique value. 

Whereas it is conceivable that the extension of the usual conception of a 
derivative to functions of a complex variable may require a distinction 
with regard to the direction assigned to the increment Ac in the complex 
plane, the simplicity and general usefulness of this derivative would 
unquestionably be greatly impaired if its value were subjected to such a 
restriction. The undesirability of the latter suggests that one ought to 
demand that the derivative operation be completely free from this 
rq^striction, and then inejuire into the bonds which are thereby laid upon 
the nature of the complex function. If these are not so severe as to rule out 
of consideration such classes or kinds of functions as one would like to see 
embraced by the theory^ which is the object of this discussion, one may 
still be served by pursuing it. 

As will be seen shortly, it turns out that this point of view may well be 
taken and, surprisingly enough, that more functions out of a set written 
down at random fall into a classification which meets these bonds than 
one might at the outset expect. In fact, the results are so gratifying in this 
respect that one is justified in ruling out of further consideration all 
complex functions which do not comply with this demand and in stipulat- 
ing that the term “ function of a complex variable ” shall apply only to 
those that do. 

The conditions under which the derivative of the function w with 
respect to the complex variable z may be independent of the direction in 
which the increment As is taken are readily found by first indicating the 
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derivative by means of the relations 2 and 3 as 

dw dii + j dv 


Now from Eq. 4 


and 


dz dx + j dy 


- , du , 

dll = — dx H dv 

dx dy ' 


, dv , dv . 

dv — — dx H dv 

dx dv ' 


Substituting into Eq. 6 and writing the result in the form 
/ dll . di\ / dll . dy 
dw \d.r ^ dx) \6y dy) dx 


dz 


1 , 


[ 6 ] 

[7] 

[ 8 ] 

[9] 


show that the direction of the differential increment dz is determined by 
dy/dx, and hence, if the expression 9 is to be independent of this direction, 
that the necessary conditions are expressed by 


dll . dv 

dll . dv 


1 

J 


[ 10 ] 


or 


du . 

dy~^'^dy dx dx 


[ 11 ] 


Equating real and imaginary parts in this equation gives the conditions 

du dv 
dx dy 


[ 12 ] 


and 


du dv 

dy dx 


[13] 


These are the conditions which the real and imaginary parts of the com- 
plex function w = f(z) must fulfill in order that its derivative may have 
a unique value for any point s, regardless of the direction of the increment 
dz at this point. Equations 12 and 13 are known as the Cauchy-Riemann 
partial differential equations {or condition equations). Only those functions 
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w ^ u -{■ jv which satisfy these equations are henceforth to be called 
functions of a complex variable. 

Practically all the common functions familiar to the reader are found 
to satisfy the conditions 12 and 13. The simplest of these functions is 


w = z 

[14] 

for which 


u — X and v = y 

[15] 

is obviously a function of a complex variable. 

\ constant times an integer p)ower of z, namely 


w = as" 

[16] 


is likewise seen to satisfy Eqs. 12 and 13. Hence it follows that any 
polynomial 


■w = a„s" + a„_,a" * 


• • + aiS + tto 

[17] 

or any (luotient of polynomials 




a„a" + 

+ • ■ 

• • + flis + ao 

[18] 


+ • ■ 

• -f bxz -h 


are also functions of a complex variable. The familiar trigonometric, 
hyperbolic, and e.xponential functions as well as the logarithm, w’hen 
regarded as functions of all satisfy Eqs. 12 and 13. Fractional powers of 
z and fractional powers of polynomials in z satisfy the conditions. It is, in 
fact, more ditlicult to find functions that do not satisfy the conditions 12 
and 13 than it is to find those that do. A few exceptions are 

= H = [19] 

and 

w — z — X — jy [20] 

but even these simple exceptions are rather peculiar and hardly worth 
bothering with anyway. 

It is appropriate to point out that the fulfillment of the Cauchy-Rie- 
mann equations does not suffice for the existence of the derivative. The 
latter requires that the partial derivatives 12 and 13 be continuous func- 
tions of X and y in the vicinity of the point in question. 

A point at which the function is not differentiable is called a singularity. 
If the function is differentiable everywhere within an arbitrarily small 
region in the vicinity of some point, it is there said to be regular or 
analytic (the term Iwlomorphic is also used to describe this property). A 
region throughout which a function is analytic is spoken of as a region of 
andyticity. 
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2 . A GiLVPHicAL representation; conformal mapping 

Since the complex function u* as well as the complex variable z require 
two quantities (their real and imaginary parts) for their description, the 
graphical representation used for functions of a real variable is not 
available for the plotting of functions of a comi)lex variable. Instead, the 
values of the variable c are plotted in one complex plane (the jc,y-plane or 
>plane), and the corresponding values of the function iv arc plotted in 
another complex plane (the /Gr-plane or ^c’-plane). A given ix)int in the 
>plane represents a complex value hw the independent variable z which 
determines a value for the function and this value in turn determines a 
corresponding point in the tc'-planc. A continuous curve in the c-plane may 
be thought of as defining a set of jx)ints in this plane, and if the function 
li' = f{z) is continuous throughout this range of ::-values, a corresix)nding 
continuous curve is thereby determined in the 7 v’-planc. 

The construction of a family of curves throughout a given region in 
the ::-plane makes it possible to map the behavior of the function over a 
corresponding region in the xe-plane. For this puqx)se the cur\'es drawn 
in the ii-plane may, for example, be the sets of straight lines defined by 
X = constant and y = constant, the constants being chosen so that these 
orthogonal families of lines form a unifonn grid. Alternatively, a set of 
circles concentric with the origin and the orthogonal set of radial lines 
may be drawn in the c-plane, or one may select any other sets of curves 
which appear best to serve the puq>ose in view of the particular function 
under consideration. 

It is useful to note a very interesting property of functions of a complex 
variable which is made evident by such a graphical representation. This 
property follows directly from the fact that the derivative of the function 
is independent of the direction of the vector increment dz in the c-plane. 
Assuming for the moment that the increments are finite, the derivative 
is approximated by Xw/ Xz, If the increment A:; is thought of as having a 
fixed magnitude but any desired angle, the fact that the quotient As 
has the same complex value regardless of the angle of As means that 
variations in the angle of Xw are exactly equal to any assumed variations 
in the angle of As. In other words, since the angle of the complex quotient 
^w/ As is independent of the angle of As, the changes in the angles of As 
and Aw must always be equal. 

Any two increments Ajs and A22 differing only in direction at a given 
point in the s-plane may be looked upon as a pair of path increments 
along any two curves which intersect at that point; and, similarly, the 
two corresponding increments Aiw and A2W (whose magnitudes are alike 
because Aiw/Aiz = A2WI A^z) may be looked upon as a pair of path 
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increments alon^( two curves intersecting^ at the corresponding point in 
the it'-plane. Since the angle between the path increments N\Z and A 22 
equals the angle between the corresponding path increments AiW and 
iiny sets of curves drawn in the c-plane intersect at the same angles 
as the corresponding curves in the i^’-plane at all corresponding points. 
The process of mapping cur\'es in the xc^-plane corresponding to any 
chosen curves in the c-plane, in other words, preserves the angular rela- 
tionships between these curves at all corresponding points. For example, 
if two sets of cur\'es drawn in the s-plane form orthogonal families, the 
corresix)nding sets of cur\x's which arc mapped in the tv'-plane by means 
of any function ic — f(z) (satisfying the conditions 12 and 13, of course) 
likewise form orthogonal families. 

It should be observed that if the angle of the increment A 2 Z is larger 
(or smaller) than the angle of Aiz, the angle of ^ 21 ^ is likewise larger 
(respectively smaller) than that of Aixc. In other words, the angular 
increment between Aitc and Aotc is equal to the angular increment between 
AiC and Aoc not only in magnitude but also in sense. That is, if the rotation 
from Aiz to AoC in the c-plane is, for example, counterclockwise, the rota- 
tion from Aiu' to Ao^c in the 7c-planc is also counterclockwise. 

If u' = f(z) is a function satisfying the conditions 12 and 13, the func- 
tion ic = /(c), in which the bar indicates the conjugate value, evidently 
does not satisfy these conditions. Since, for a given Ac, the increments 
Alt’ and Ai? have the same magnitudes but opposite angles, it is clear that 
the function v = /(c), in its mapping properties, preserves the angular 
relationships in magnitude but reverses their sense (as is the case with a 
picture and its mirror image). The mapping properties of both the func- 
tions ic = /(c) and = /(c) are said to be isogonal (meaning that the 
iXiagnitudes of angular relationships are preserved). In addition, the 
mapping property of the function tc = f{z), which preserves the sense as 
well as the magnitude of angular relations! .ips, is described as being 
conformal. 

As a consequence of the property of conformality, one may see that if a 
small region of a map in the c-plane (with numerous intersecting curves) 
is compared with the corresponding small region in the it?-plane (with 
numerous corresix)nding intersecting curves), these two small mapped 
regions are found to be exact replicas of each other except for a factor of 
magnification (or diminution) equal to the magnitude of Aw/ Az at the 
point where this region is located, and a rotation through the angle of 
Aw/Az. This observation is strictly accurate, of course, only in the limit 
as the size of the entire region tends to zero, but for small regions of finite 
size, the two corresponding maps are very nearly alike in detailed form. 
The term conformal " is thus seen to assume a clearer significance. 
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3. The inverse function 


The corresponding maps in the u*- and :;-planes for a given function 
= f[z) place in evidence a mutual relationship between the two com- 
plex quantities it* and z in the sense that either one may apparently be 
looked upon as the independent variable. In other words, the given 
function 

■w =/(=) = «(.v,v) +X.v,>>) [21] 

may presumably be inverted to yield 

2 = <^>( 1 ;') = +jy{ti.v) [22] 

at least over regions throughout which a one-to-one relationship exists 
between U' and This thought may be investigated further through the 
consideration of the relations 

- dx , Ox _ 

dx = — du H dv [23] 

Oh dv 


, dv , dv , 
ay = ~ d// + — dv 
du dv 


[24] 


which are the inverse of Eqs. 7 and 8. 

Denoting the determinant of Eqs. 7 and 8 by D, and noting Eqs. 12 
and 13, one sees that 


du dv du dv _ 

dx dy dy dx \dx/ \dy/ \dx/ ^ \dy/ 


[25] 


But, again with the help of Eqs. 12 and 13, one has* 


diV 

dz 


= /'W 


so that Eq. 25 yields 


dll , . . du _ dv ,dv 

dx ^ dx dx dy dy^^ dx 

D = \f'(z)\^ 


[26] 

[27] 


Now Eqs. 7 and 8, on the one hand, and the pair of inverse Eqs. 23 
and 24, on the other, must have inverse matrices; that is, 


dx 

dx 


du 

du 

du 

dv 


dx 

dy 

dy 

d_y 


dv 

dv 

du 

dV^ 


dx 

9y. 


[28] 


*The correctness of these relations should be clear from the fact that the value of the 
derivative Ls independent of the angle of the increment dz = dx j dy. If this angle is zero 
then dy = 0, which means that u and v are differentiated partially with respect to x only. 
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from which it follows that 


dx 

1 dv 

dx _ 

1 du 

[29] 

du 

D dy 

dv 



1 dv 

^ = 

1 du 

[30] 

du 

D dx 

dv 

Ddx 


Hence 


dx dy 
du dv 


[31] 


dy 

dv du 


[32] 


which are the Cauchy-Riemann equations pertaining to the inverse 
function 22. 

With the help of these various relations, one may now write 

dv . dv 

_ 1 

D dv . dv 

1 ^ 

^ /'(s) \dz) [33] 

dx dx 


dz _ dx , dy 
dw du~^'^ du 


According to Eqs. 31 and 32, the inverse function 22 is also a function 
of a complex variable, and Eq. 33 shows that Jie derivative of the inverse 
function is the reciprocal of that for the given function at corresponding 
values of iv and z. In other words, the conformal maps for the function 
= f(z) may likewise be regarded as the maps for the inverse function 

A precautionary remark may be made here about difficulties of inter- 
pretation in dealing with multivalued functions. Although these matters 
are discussed in greater detail in subsequent articles it should be observed 
now that, in view of Eq. 33, the derivative of the inverse function e\i- 
dently does not exist at a point where that of the given function w = f(z) 
becomes zero. In the immediate vicinity of such points, the maps in the 
tc’- and s-plancs for a given function are still uniquely related, although the 
preservation of angular relationships no longer holds (as is further dis- 
cussed in Art. 14). 
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4. The :;-plaxe and its associated complex sphere; the point 

AT IXTINITY 

At various times it is expedient to consider the value or behavior of a 
function at infinity, that is, in the limit c — > ». Since any point in the 
complex ::*plane which is infinitely remote from the origin is a {xiint at 
infinity, it may seem as though infinity should be regarded as a vast 
region embracing an infinity of {wints. While admissible on purely logical 
grounds, this view is extremely awkward from a mathematical standix)int, 
since the behavior of a function “at infinity^’ would embrace its behavior 
at an infinite number of points. 

The difficulty involved here is readily overcome, however, through 
introducing (by definition) a slightly altered conception of the complex 
plane. Thus it is perfectly admissible to think of this plane as being the 
surface of a sphere of infinite radius, or, for the sake of easing the mental 
strain produced by this conception, as a sphere with so large a radius that 
any finite region that may be considered appears for all practical i)uqx)ses 
to be flat. If the origin of this “ plane " is taken to be the south pole of 
the sphere, all {X)ints infinitely remote from the origin coincide at the north 
pole. Infinity is then no longer a vast region but becomes a single |)oint. 
It is called Ihe point at infinity. 

In order to overcome the necessity of thinking of the >plane as an 
enormous spherical surface, another artifice may be utilized which in some 
respects has certain advantages over the infinite .sphere idea. Thes-plane is 
visualized as a truly flat surface, with a sphere of arbitrary but finite 
radius resting upon its origin. The point of tangency between the sphere 
and the s-plane at its origin may be taken as the south fxile of the sphere. 
The corresponding north px)le is then perpendicularly above the origin. 

A given finite point Zq in the i:-plane is now thought of as joined with the 
north pole of the sphere by a straight line, which intersects the surface of 
the sphere at one point other than the north jxile. This geometrical con- 
struction, which is called slereographic projection, associates a {loint on 
the sphere with every point in the complex c>plane in a manner unique 
for all points except those infinitely remote from the origin. All these 
correspond to the north pole of the sphere. Thus, with the help of stereo- 
graphic projection, infinity is again interpreted as a single point. 

In all considerations regarding regions and paths in the complex plane, 
the corresponding ones on the surface of the sphere may be substituted. 
It can be shown geometrically that the process of stercographic pro- 
jection preserves the magnitudes of angular relationships between inter- 
secting curves in the plane and the corresponding ones on the spherical 
surface (the process is isogonal). Circles in the plane are circles on the 
sphere. In particular, any circle on the sphere which passes through the 
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north pole becomes a straight line (circle with infinite radius) in the 
plane. A great circle through the north pole (meridian) corresponds to a 
straight line drawn through the origin (radius vector) in the 2 -plane. 

Such a sphere is referred to as the complex sphere associated with the 
2 -planc. A similar sphere may evidently be associated with the 2£;-plane 
in connection with the mapping of any function iv = fiz). Because of 
the isogonality of the process of stereographic projection, it follows that 
corresponding maps on the two spheres for a given function are conformal, 
so that the spherical surfaces may in all instances be used to replace the 
complex t4'- and 2 -planes. In this way the process of conformal mapping 
may readily be visualized over regions which include the point at infinity. 

5. Alternative graphical and physical interpretations 

A numlier of additional interesting properties of functions of a com- 
plex variable may lie studied through identifying the complex plane with 
a cross-sectional plane associated with a physical system having longi- 
tudinal uniformity. This direction coincides with what is ordinarily 
designated as the 2 -axis of a rectangular co-ordinate system. A static 
field (electric, magnetic, or hydrodynamic) associated with such a 
supposed physical system has either a zero or a constant component in 
the longitudinal direction. This component is ignored. In the following 
discussion, and w’herever the physical argument requires three-dimen- 
sional or space consideration, it is understood that a unit length in the 
longitudinal direction is implied. In other w^ords, the field is regarded as 
a two-dimensional one, since its behavior is of interest only in the cross- 
sectional plane which is identified with the complex .v,y-plane. 

In such a system, a vector function A (.v,y) is assumed to represent 
the flow of some physical or fictitious fluid. Ir complex form 

A = A-jA; [34] 

With reference tcft'ig. 1, S represents a closed (mathematical) bound- 
ary. If a differential length of this boundary is denoted by dsy the net flow' 
outward through the boundary' is given by the integral 

A nr/5 [35] 

in which A n is the normal component (directed outw'ard) of A. According 
to the geometry show'n in the figure, and on the assumption that the 
integration is extended around the closed contour in the counterclock- 
wise direction, this integral may be written 

An r/5 = (Aj dy - Ay dx) 


[36] 
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From Gauss’s law, it is recog- 
nized that this net flow outward may 
alternatively be calculated by integra- 
ting the divergence of A over the 
surface* enclosed by the boundary 5, 
that is 

jT An ds = y'div A da [37] 

enclosed surface 


Fig. I. Relevant to the integral of the 
normal component of a vector function 
around a closed contour. 

and hence Eqs. 36 and 37 yield 


j . A ^A j dA 

div A = H 

dx dy 


£ ul. dy - A, dx) = da 


enclosed surface 


If Ax and Ay are now identified respectively with the imaginary and 
the real parts of a function of a complex variable w = /(z), thenf 

Ax = iix,y) [ 40 ] 

Ay = ii{xj) [ 41 ] 

and the Cauchy-Riemann Eq. 13 shows that 


div A = 0 


and hence that 


^ {Ax dy — Ay dx) = ^ {vdy — u dx) = 0 


In order that the divergence of A with components defined by Eqs. 
40 and 41 shall be zero throughout the surface enclosed by the boundary 
5, it is necessary, of course, that the Cauchy-Riemann equations hold 
throughout this region. This condition requires that the function w = 
f{z) be regular throughout the region; otherwise its derivative does not 
exist at all points over which the surface integral in Eq. 37 or Eq. 39 
extends. 

Physically, Eq. 43 means that the field A is source-free throughout 


*The reader may again be reminded that a unit of length in the longitudinal direction is 
implied so that this surface integration is actually equivalent to a volume integration, 

fit should be observed that w = Ay -j-jAx and hence that w = f(z) should not be con- 
fused with the vector function ^4, Eq. 34. 
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the enclosed region, as evidenced by the vanishing of the divergence of A 
for all points within the region. Hence it may be said that the imaginary 
and real parts respectively of a function of a complex variable which is 
regular throughout a given region in a z-plane may there be regarded 
as the X- and y-components of a source-free field. 

If now the line integral of the vector function A is formed for the 
closed boundary shown in Fig. 1, with a counterclockwise direction of 
traversal, one has 

.4 • tfs = jT (Ax dx + Ay dy) [44] 


According to Stokes’s law, it is recalled that 


i 


A (15 = 


J' (curl A) ■ da 

enclosed surface 


[45] 


The curl of A, which is directed normal to the x*,y-plane, is given by 
what would normally be regarded as the z-component, that is. 


curU.l = — " 
dx 


dAx 

17 


[46] 


Equations 44 and 45, therefore, yield 

f [47] 

enclosed surface 


Again identifying Ax and Ay with v and respectively, according to 
Eqs. 40 and 41, and making use of the Cauchy-Riemann Eq. 12, one 
finds that 

curl ^ = 0 [48] 

If the function w = f(z) is regular throughout the region enclosed by the 
curve Sf Eq. 48 holds for all points within this region, so that 

^ (Ax dx + Ay dy) = ^(vdx + u dy) = 0 [49] 


Hence it may be said that the imaginary and real parts respectively of a 
function of a complex variable which is regular throughout a given region 
in the z-plane may there be regarded as the x- and y-components of a 
field which is not only source-free but also non turbulent. If the enclosed 
region contains points at which the derivative of the function w = f(z) 
does not exist, the relations 43 and 49 no longer hold. In view of the 
present discussion such singular points may be regarded as either sources 
or vortexes in which the origin of the field A, and hence that of the 
function w = /(z), resides. This interpretation makes it clear that unless 
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the function 'w has singular points somewhere in the s-plane, it must 
reduce to a constant or to zero. In other words, the singularities of a 
function are its life-giving ’’ elements, and out of their nature and 
distribution alone does a function derive its individual properties and 
characteristics. This view leads to a useful method of classifying functions 
purely in terms of the nature and distribution of their singularities, which 
is briefly discussed later on. 

The Cauchy-Riemann Eejs. 12 and 13 provide a further physical 
interpretation for the real and imaginary parts of a function of a complex 
variable. Thus if Eq. 12 is dilTerentiated partially with respect to x and 
Eq. 13 with respect to y, the subsequent addition of the two equations 
yields 


dx~ df 


0 


[50] 


On the other hand, if Eq. 12 is differentiated partially with re.spect to 
y and Eq. 13 with respect to .v, the subsequent subtraction of the two 
equations gives 


dx^ dy^ 


= 0 


[51] 


These results are recognized to have the form of Laplace s equation for 
the potential of a two-dimensional source-free held. Hence the real and 
imaginary parts of a function of a complex variable may be interf)reted 
as scalar potential functions. As such they may be assumed to determine 
a pair of nonturbulent vector field functions. If the.se are denoted by A 
and B respectively, one may write 


and 


dll 
— > 

dx 

. dll 

[52] 

dv 

to' 

dy 

[53] 


Because of the Cauchy-Riemann equations, these field components are 
related as expressed by 

A, = By, Ay = [54] 

and hence the scalar product of A and B vanishes; thus 

A • B = AxBx + AyBy = —AxAy + AxAy = 0 [55] 


In other words, the two nonturbulent fields defined by the jx)tential 
functions u and v are orthogonal to each other. 



Art. 


INTEGRATION; THE CAUCHY INTEGRAL LAW 


267 


Xow it is further recalled (from the study of vector analysis) that the 
system of equipotential lines defined by the equations u = constant, and 
the flow lines for field A form orthogonal families of curves. The same is 
true of the equipotential lines defined by the equations v = constant and 
the flow lines for field H. Since the flow lines for field B are orthogonal to 
those for field yl, it follows, therefore, that the equipotential lines defined 
by u = constant are orthogonal to those defined by v = constant. 
Hence the latter coincide with the flow lines for field yl, and the former 
coincide with the flow lines for field B. This situation forms an alternative 
basis for the graphical representation of a function of a complex variable 
and for its physical interpretation. 

'I'hus, instead of using the conformal maps in the and s-planes, one 
may study a given function of a complex variable graphically by plotting, 
in the ::-plane alone, the systems of mutually orthogonal curvxs defined 
by the equations u ~ constant and v = constant. Throughout regions in 
which the given function n* — f(z) is regular, these have the character of 
the er|uiix)tential lines and flow lines of a source-free, nonturbulent field. 
Singular |)oints again have the character of sources or vortexes, the nature 
and distribution of which determine the properties of the given function 
Some aspects of these physical interi)retations are discussed further in 
Art. 22. 

6. Integration; tiie Cauciiy integral law 

.'\ certain orientation with regard to the question of differentiation 
having been gained, attention may now be directed toward the interpreta- 
tion of the integral (fl' a function of a complex variable. Here the two- 
dimensional character of the independent variable again injects some 
novel considerations at the outset. Thus if the integral 

Cmdz [56] 

•/n 

is formally regarded as representing the integral of a given function 
= f(^) between two particular values Zi and zo of the independent 
variable c, the question of the choice to be made for the continuous 
sequence of values of z as it proceeds from the point Zi to the point Z 2 
immediately arises. Such a continuous sequence of values evidently 
defines some path or curv'e joining the pxiints and Z 2 in the complex 
s-plane. Since any number of paths may evidently be chosen in the 
detailed process of evaluating the integral 56, the possibility exists that 
the value of this integral may not be unique. 

This question is similar to that arising in connection with the discussion 
of the differentiation of complex functions, and it is again due to the two- 
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dimensional character of the independent variable. Again it is felt to be 
highly desirable that the value of the integral vS6 should be unique, and 
if possible, that the Cauchy-Riemann equations, which insure the 
uniqueness of the derivative, should be suflicient to insure also the unique- 
ness of the integral vS6 so that its value 
may be independent of the path of in- 
tegration without need for the im}K)si- 
tion of further conditions ujx)!! the 
function iv. 

In order to investigate this ques- 
tion it is expedient to consider the 
integral formed for a closed contour 
in the ::-plane, as shown in Fig. 2. 
This so-called contour integral is writ- 
ten 



Fig. 2. Region of analyticity in 
the discussion of Cauchy’s integral 
law. 


f{z) dz 


[57] 


Here the contour C is assumed to be traversed in the counterclockwise 
direction, whence the enclosed region G is observed to lie on the left. 
Within the region G the function xc* = f{z) is assumed to be regular. For 
all points within the region G, therefore, the function is differentiable, 
and the Cauchy-Riemann equations are fulfilled. 

By substitution from Eqs. 2 and 3, the integral 57 becomes 

^ f(z)dz = (« +Jv)(dx -{-jdy) 


^ (u dx — V dy) -f j 


(? dx -(- u dy) [58] 


The closed contour C in Fig. 2 and the conventions regarding the direc- 
tion of traversal are essentially the same as those shown in Fig. 1 for 
the closed boundary 5; and since the function w — f(z) is assumed to be 
differentiable at all points within the enclosed region, the results ex- 
pressed by Eqs. 43 and 49 apply. Hence the important result follows that 

£f{z)dz = Q [59] 

Since the points Zi and Z 2 appearing in the integral 56 may be thought 
of as any two points on the contour C, as indicated in Fig. 2, it follows 
from Eq. 59 that the integral 56 is independent of the path of integration 
or that a given path joining Zi with 22 niay be changed at will without 
altering the value of the integral 56 so long as the path is not moved 
across a point at which the function w is singular. The latter restriction is 
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readily appreciated through noting that if the two portions of the contour 
C joining Zi and Z 2 in Fig. 2 are regarded as two variations of a given path 
between these points, the statement that the enclosed region G contains 
no singularities is seen to be equivalent to stating that none are en- 
countered in the process of sweeping one of these paths across the region 
G into coincidence with the other. 

The result 59 is known as the Cauchy integral law, which states in 
elTcct that the integral of a function of a complex variable between two 
given points in the complex s-plane has a unique value (subject, of 
course, to the restriction that any two chosen paths enclose a region in 
which the function is regular). The nec- 
essary conditions to insure this result are 
expressed by the Cauchy-Riemann equa- 
tions which at the same time insure the 
uniqueness of the derivative of a given 
complex function. 

'Fhis result is so important for all the 
subsequent discussions that it is well to 
consider in another way the relations 
leading to it. Figure 3 shows a differen- 
tial rectangle in the s-plane with its 
center located at some point z. The mid- 
jx)ints of the sides of this rectangle are 
denoted by a, b, c, d. A given function 
has the value lo = f{z) at the point s and is there assumed to be regular. 
Except for differentials of higher order, its values at the points a, h, c, d 
arc given by 


d 

^dx- 


'T 

Jdy 


Fig. 3. 
path 


An elementary closed 
the consideration of 


Cauchy’s integral law. 



[60] 


[61] 

1 

II 

[62] 

f{d) = a’ — TT - 

Jdy 2 

[63] 


Now since the function w satisfies the Cauchy-Riemann condition 
equations, its derivative is independent of the direction of the increment 
in the s-plane, so that 

dx j dy dz 


[ 64 ] 
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and hence 

f, . ,djdx 

f{a) ==w + -- — 

[65] 



[66] 


II 

1 

[67] 


fid) = U. 

[68] 


The integral of the function iv formed (in the counterclockwise direction) 
for the differential rectangle is given by the summation 

^ fi^) = J 1 / - /( dx — dy fid) dx [ 69 1 



Fig. 4. Approximation of a contour by a rectangular step curve. 

Substituting from Eqs. 65 to 68 shows that 

f /(s) dz = 0 [70] 

As illustrated crudely in Fig. 4, a given c losed contour C of finite size 
may be thought of as approximated by a rectangular step curve. 'Fhc 
approximation becomes better and better as the size of the steps is made 
smaller and smaller. The integral around the closed contour C may be 
replaced by the sum of integrals around all the enclosed small rectangles 
(all taken in the counterclockwise direction) because the contributions 
from the sides of adjacent internal rectangles cancel, just as in the 
argument leading to the result known as Stokes's law in vector analysis. 
If the function w = f(z) is differentiable at all points within the region 
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G, the result 70 is applicable to all the enclosed small rectangles, and thus 
the result 59 is again established. The Cauchy-Riemann equations to- 
gether with the condition that w be differentiable at all points within the 



Fig. 5. simply connected region. 



Fig. 6. A multiply connected region. 


enclosed region are again found to be necessary and sufficient for the 
validity of F^q. 59. 

The following remarks about the characteristics of the region enclosed 
by the contour C are necessary. This contour may conceivably have a 
form like that shown in Fig. 5, for which the enclosed region G is the 
shaded area. If the portions of the contour C leading to and from the 
smaller islandlike regions Ci, G 2 , and G 3 within G are moved closer and 
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closer together until they finally become superimposed, the contour 
integral around C evidently becomes equivalent to the sum of four 
separate integrals evaluated respectively for the contours C', Ci, C2, and 
C3 with directions of traversal as indicated in Fig. 6. 

So long as u' = /(z) is regular within the shaded region G, the integral 
law 59 still holds when evaluated for the contour C of Fig. 5 or for the 
equivalent set of contours C\ Ci, Co, and C3 shown in Fig. 6. It may no 
longer hold, however, if applied only to the contour C' of Fig. 6. Actually 
the enclosed regions Gi, G2, and G3 may contain ix)ints at which the 
function n* = f{z) is not differentiable and therefore, the integrals per- 
taining to the separate contours C'l, C’2, and C 3 arc not necessarily zero. 

A region which has embedded in it one or more subregions Gi, G2, • • • 
like the region G of Fig. 6 is said to be multiply connected. It is doubly 
connected if it contains one subregion, triply connected if it contains two 
subregions, etc. Unless the contour C in the integral 59 is interj)reted in 
the manner illustrated in Figs. 5 and 6, the validity of Cauchy’s integral 
law evidently requires that the region enclosed by the contour be simply 
connected. 


7. Cauchy’s integral formula 


A given function is assumed to be regular within the region enclosed 
by the contour S shown in Fig. 7. It is also assumed to be regular at all 

points on the boundary 5. These 
points are denoted by f, and the 
corresponding values of the func- 
tion are expressed by 

u'=/(r) [71] 

A contour integral is now con- 
sidered which has the form 



m 

's (f - s) 


[72] 


Here z denotes some point within 
Fig. 7. The region of analytic! ty in the the enclosed region. If f is for 
derivation of Cauchy's integral formula is moment regarded as capable 
the boundary 5 and_the region enclosed assuming any value within the 

enclosed region as well as those on 
its boundary, it is observed that the integrand as a function ^f f is reg- 
ular at all such points, with the exception of the point f = 2. At this 
point the integrand becomes infinite, and therefore Cauchy’s integral 
law no longer applies; the value of the integral 72 is then not neces- 
sarily zero. 
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However, if the point 2 is surrounded by a circular contour C, and the 
integration is extended over a closed path which consists of the curve S 
and the circle C joined by a line which is traversed in both directions, as 
shown in Fig. 8, the result is evidently zero because the corresponding 
enclosed region is the shaded area where the integrand is regular. It 
follows, therefore, that the value of the integral 72 is equal to that of 


Jc (f - z) 


[73] 


y 



in which the circular contour is traversed in the counterclockwise 
direction, and f now refers to points on this circular contour. The correct- 
ness of this statement is recognized from the fact that the difference 
between the integrals 72 and 7v^ is the integral around the composite 
boundary of Fig. 8, which has the value zero. 

The radius p of the circle C about z is now to V)e thought of as being very 
small, so small in fact that throughout the process of carrying out the 
integration of 73 the value of /(s') ditYers from that of f{z)hy a negligible 
amount. In view of the fact that /(f) is a continuous function within the 
region bounded by 5, this is a permissible assumption. (It would obviously 
not be permissible if /(f) were discontinuous at the point z.) Now if /(f) 
in the integral 73 is replaced by f(z), it may be taken outside the integral 
sign, because the integration is evaluated with respect to the variable f. 
Thus one has 


Js - z) 


<fMAL 

Jc ({• - s) 



[74] 


With reference to Fig. 7, the polar form of (f — s) is 

(f - z) = 


[ 75 ] 
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and since f is constrained to lie on the circle C, 



di = jpe’* de = y(f - z)de 

[76] 

so that 

cr -»)■''*’ 

[77] 

Hence 


[78] 

and consequently Eq. 74 yields 




[79] 

or 


[80] 


This result is known as Cauchy's integral formula. It enables one to cal- 
culate the value of a function of a complex variable at a point within a 
region in terms of its values on the boundary, provided the function is 
known to be analytic throughout the region inclusive of the boundary. 


8. The existenxe of derivati\’es of axy order 


By means of Cauchy’s integral formula, Eq. 80, it is possible to show 
that a function of a complex variable possesses derivatives of any order 
at a point w'here the function is regular. The familiar formula for the 
derivative of a function w = f{z) reads 

dw . 

— = limit 
dZ 


7(s -b Ac) -fizV 


Xz 


[81] 


According to the formula 80, 


/(, + ^).-Lf iwds 

2itj Js f — (2 + As) 


Substitution of Eqs. 80 and 82 into Eq. 8l gives 


^ - itait f 


m 

A2 


1 


f - (s + A2) 




[82] 


[83] 


1 1 A:; 

f - (s + As) f - s - zf - hz{i - z) 


[ 84 ] 


Now 
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so that the limit 83 yields 


— = — <£ 
dz 2irj Js (f — z)* 


[85] 


The second derivative, according to the form of Eq. 81, is expressed by 




w . rf 

T = limit — 
^-*0 _ 


(z + Az) - fizY 


As 


in which the prime denotes the first derivative. By Eq. 85 

fu + = _L <f , 

lifjJs [f — (s -h As)]^ 


[ 86 ] 


[87] 


and Efi- 86 therefore gives 
d. 

But 

1 


^ = limit f i ^ - ^ ■ ] di\ [88] 

2(f — s) As + As" 


I 


[f - (s + As)]-' a- - zY (f - s - As)-(f - S)- 
so that the limit 88 is found to yield 

^ A (fimiL 

dz^ IwjJs (f — s)^ 


[89] 


[90] 


Continuing in this way, one establishes the following formula for the 
nth derivative: 


^ ^ 111. (f 
ds" 2x7 Js (f - s 


[91] 


It may be concluded that the function uf = /(s), if regular within a given 
region, possesses derivatives of any order for all points of this region, and 
these derivatives may be obtained through differentiating under the 
integral sign in the Cauchy integral formula 80. The existence of but a 
single derivative thus implies the existence of all subsequent derivatives! 
The elegance of this remarkable result can hardly be overemphasized, 
nor is it out of place to call attention to the fact that a similar result does 
not apply to real functions. 

The values of the derivatives of higher order are seen to be unique if 
this is true for the value of the first derivative of a given function. The 
first and all higher order derivatives of a function of a complex variable 
are themselves functions of a complex variable, and these derivative func- 
tions are analytic at any point where the given function is analytic. 
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9. Point sets and infinite series 

Preliminary to the discussion of the expansion of functions in infinite 
series, which is given in the following article, it is helpful to \yo\nt out a 
few fundamental principles regarding the convergence of such series. 
This subject may best be approached through considering an infinite set 
of points 

^1, ^2, 5a, • • • • • • [92] 

Visualize the quantities 5 i, 50, etc., as p(')ints in the complex plane. They 
may be scattered about completely at random, or the sequence of their 
values may exhibit a tendency to become confined within a more and 
more limited area or region as the index k (identifying individual values 
in the sequence) becomes larger and larger. In the latter case the sequence 
is said to approach a limit and the set of points is said to possess a limit 
point. 

If one considers a number of concentric circles (with finite, nonzero 
radii) drawn about the limit |x>int S in the complex plane in which the 
points 5 i, 52, • • • are indicated by dots, each of these circles, however 
small, contains an infinite number of jxiints. This statement is the defini- 
tion of a limit point. Considering a particular circle of radius e, it is possi- 
ble to state that some integer n e.xists such that all {xiints Sk for k > n 
lie inside this circle. For any given radius e, the appropriate value of n 
must be sufficiently large to assure that none of the points in the unending 
sequence 5 ,i+i, 5,14.0, 5„4.3 • • • lie outside the circle (although some for 
k < n may still be inside) ; or, for a given integer n, the appropriate radius 
€ must be sufficiently large. As the value of n is chosen larger and larger 
the appropriate € may be chosen smaller and smaller since the points be- 
come denser and denser in the more immediate vicinity of the limit 
point. The latter, for this reason, is sometimes also referred to as a 
cluster point or as a point of condensation. 

It thus becomes clear that a limit 5 of the unending setiuencc 5 i, 50, 5.3, 
• • • may be said to exist if 

|5 — 5^1 < €, for all k > n [ 93 ] 

in which € is nonzero but may be chosen arbitrarily small, and n is a 
finite integer depending upon e. If this condition is fulfilled, the sequence 
is said to converge to the limit S. 

J An important result known as the Bolzano-Weierstrass theorem follows 
Trom the definition of a limit point. This theorem states that if an unend- 
ing sequence of points 5 i, 52, 53, ' is confined within a finite region, that 
region must contain at least one limit point. The truth of this statement 
may be seen through considering the region to be subdivided into a finite 
number of smaller ones, for example, squares. Since the number of points 
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is infinite and the number of squares finite, it is clear that at least one of 
the squares must contain an infinite number of points. This one may 
again be subdivided into smaller squares and the same reasoning repeated. 
Through continuing in this way one may state that within the original 
region it must be jx)ssible to find at least one nonzero but arbitrarily 
small subregion containing an infinite number of points, whence, accord- 
ing to the definition of a limit point, that subregion must contain one. 

The convergence condition 93 may be expressed in an alternate form, 
knowm as Cauchy s principle of convergence, which reads 

“ 5n| < €, for a finite n and all p = 1, 2, • • • co [94] 

According to this statement, a circle drawn about Sn with the finite 
radius e contains all for k > n; that is, it contains the unending 
sequence Sn^i and in view of the Bolzano-Weierstrass 

theorem it must, therefore, contain a limit point. 

('onsider now the infinite series 

S = 2 //« = /q + W2 “b ^^3 "}" • • • [95] 

and its partial sum 

Sfc = /q -f- //•> + ^3 + * * * + Wfe [96] 

For k = 1, 2, 3, • • • these partial sums may be regarded as elements of 
the unending sequence Si, 5^, -S3, * • • discussed above, and S as its limit. 
If this limit exists, the infinite scries 95 is said to converge; if the limit 
does not exist, the series diverges. Cauchy’s principle of convergence 94 
may be expressed in the form 

1w„.|.i+«,.+2+ • • • <€, for a finite n and all /^ = 1, 2, • • • 00 [97] 

The quantity \S — 5a] appearing in the condition 93 is the absolute value 
of the remainder of the series 95 after the Ath term. 

Beside the series 95 it is significant to consider the one expressed by 

(T = a |w«| = l^^il + 1^2] + 1^3] + • • • [98] 

n =1 

in which the terms arc the absolute values of corresponding ones in 95. 
Since the remainder 

+ Un+2 + Wn+3 + * * *( [99] 

of the series 95 is always smaller than the remainder 

l«n+ll + l^n+2l + + * * * [100] 

of the series 98, it follows that 95 is surely convergent if 98 converges. 
In the latter event the series 95 is said to be absolutely convergent. 
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A sufficient test for the absolute convergence of the series 95 is ex- 
pressed by the condition 

limit < 1 [101] 

n— » 30 

To prove this statement one should first obscree that no matter how the 
limit of is approached as n is allowed to become larger and larger, 

whether monotonically from above or from below or in an oscillatory 
fashion, the fact that the limit is less than unity enables one to state that 
there must always exist a finite value of n such that for k > 

^ ^ p < 1- Denoting the partial sums of the series 98 by <rn, one 
may write 

lo-n+p — ^^ril = j^^n+ll+ l^'«+2|+‘ * *+ |Wn+p| ^ p”"*”* +p”*^" + * * • + [l02] 

or 

(\ p\ 

|<T„+p-<^nl^p"^'(l+p+P^+- • •+P'^*) = P"+' TT-^ ^f — [103] 

(1-p) 1— p 

If one chooses p”'^\/(l — p) = €, then € can be made arbitrarily small 
through the choice of a sufficiently large n. Therefore the condition 101 
leads to the result 

\<Tn^p — o'nl < ffir a finite n and all /> = 1, 2, • • • oo [104] 

which is Cauchy^s condition 94 for the convergence of the series 98. 

If approaches a limiting value p > 1 as n becomes larger and 

larger, the series 95 diverges since |w„i — ♦ p" > 1 for large w, in violation 
of the obvious necessary convergence condition ► 0 for n — ► x . 

Sometimes the test 101 , for one reason or another, is not applicable and 
one must employ other means for examining the convergence of a series. 
A useful alternate method is the d'Alembert ratio test which is expressed 
in the statement: If in addition to \un\ — > 0 for n —> x , one has 

^ <p < lforaU)fe>n [105] 

Uk 

the series 5, Eq. 95, is absolutely convergent. 

To prove that this condition insures the fulfillment of the Cauchy 
condition 94, one may begin by observing that 

|Mn+l| < |Mn|p 
|«n+2| < |«n+l|p < |«n|p* 

|m„+ 3| < |«„+i|p^ < |«„|p® 

etc. 


[ 106 ] 
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Hence 

i (Tn+p — O-n I = |Wn+l I + lWn4-2| “!“• * ‘H-IWn-f-pi < ! (P + P" + P^ + * * -^P^) 

[107] 


or 

l^n+p <^n| h^n| X ~ <C jwnj Z 

1 — p 1 — P 

Since j//nj — > 0 for one can always find a value of n beyond 

which //„'p (I — p) is equal to or less than an arbitrarily small nonzero e. 
Hence the C'auchy criterion is met. 

Another useful method is the so-called comparison test^ according to 
which the given series 95 is compared with another series 

X 

§ = 2^ I’n = Vi + ?2 4" 4- • • • [11^^] 

n =1 

Thus if 5» is knowni to be absolutely convergent and !//^| < where 

C is any finite |X)sitive constant, the series 5 must be absolutely conver- 
gent. The series S is referred to as the dominant of 5 (or is said to dominate 
the series S). 

Trivial as this test may seem to most engineers, it is nevertheless very 
useful, particularly when both the preceding methods of testing for 
absolute convergence fail. An example is furnished by the series 


Here 


1 with — > 00 for all finite i 
This last result may be seen by observing that 

'"Cr-G'-o 


[111] 


[ 112 ] 


Since the quantity in the parenthesis becomes zero for w — ^ oc the result 
111 follows at once. It is clear that the condition 101 is not met for any 
finite nonzero r, and yet one cannot say with certainty that the series 
diverges, since inequality 101 is a sufficient, but not necessary, condition 
for convergence. 

The criterion 105 for the d’Alembert ratio test likewise leads to an 
indecisive result, namely, one has 


Un+l 

Un 


[ 113 ] 
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The convergence of the series 110 may be investigated through first 
determining another series which dominates it. To this end one writes 



and so forth. It is clear, therefore, that the series 

§ = L = 1 + u'-o -f (2'-0" + • • • [116] 

n =0 

dominates the series 110 or 114. The right-hand side of Eq. 116 has the 
form of the power series 

5 = f c" - 1 4- + c*' + • • • [117] 

n =0 

for which so that the condition 101 reads 

H < 1 [118] 

With regard to the series 116 the corresponding condition leads to 

<1 or r > 1 [119] 

When this condition is fulfilled, the series 1 10 converges. 

For r = 1 this series reads 

5 = 2 ' = l+ § + ^' + j + ’’' [120] 

n=o n 


and 


kn+P 5n[ — 


« -f- 1 W + 2 


+ 


W -f 


n At p 


[ 121 ] 


For no finite w, however large, can this quantity be less than an arbi- 
trarily small (nonzero) e for all /^ = 1, 2, • • • <» , since for p > n its value 
approaches unity. Therefore the series is divergent for r = 1 ; and it is 
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certainly divergent for smaller values of r since the terms in the corre- 
sponding expression 121 then are all larger than they are for r = 1. 

Absolutely convergent series have the important fundamental property 
that their values are unaltered through a rearrangement of the terms. A 
convergent series that is not absolutely convergent is referred to as being 
conditionally convergent. The value of such a series can very definitely 
be altered through a rearrangement of its terms. In fact Riemann has 
shown that a conditionally convergent series can be made to have any 
finite value for its sum through an ai)propriate grouping of the terms. 

For example, the series 

5 = Z --;f— = 1 - I- + i i + i- - + • • • [122] 

is evidently not absolutely convergent since the sum of the absolute 
values of its terms is the scries 120 which is divergent. It is not difficult to 
see by inspection, however, that the series docs approach a finite limit 
in an oscillatory fashion. This limit may be computed to any desired 
accuracy, taking the tenns in their given order. One thus finds 
5 = 0.69315 • • • which, by other means, may be shown to be the decimal 
fraction approximation to In 2. If the terms are now rearranged as follows 

5 = 1 + 3 - i + -5 + 7 “ i- + 0 + iV “ i + * * • [123] 

and the limit is again computed, one finds 

5 = 1.03972 • • = 1.51n 2 

When a given series is not absolutely convergent, its possible condi- 
tional convergence may be investigated in the following way. Let the 
series be written in the form 

5 = L [124] 

n=l 

in which (/„ and are any two parts into which the typical term of the 
series is separable (one may need to make several trials at this step in 
order to find an appropriate separation). Defining the quantities 

§1 = ax 

§2 ~ + ^2 

§3 = <0 + <3^2 + ^3 [125] 


§n = ^1 + ^2 + • * * + 

one may rewrite the series 124 as 

S = §i(t'i — V 2 ) + ^ 3 ) + §3^ — ^^ 4 ) + • • • [126] 
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with the partial, sums 

‘S'n = §1 (^^1 + * * * +Sn-1 (^'n— 1 ""^^n)+Sn^^n [^27] 

Thus one finds 


‘S'n+p 

- 5„! = |-S„t’n + §n(:’n “ 

I’n+l) + 


+ Sn+p-1 0 

''» + P— 1 


<1§«! • '‘nl 

4 - |§;i| 


+ |§n+p— 1 1 



Xow if 

|§&| = + ^2 + ^3 4" * * • + ^k\ < A for 3^11 k [129] 

with A finite, 

;*5'n-|.p •S/jj .c'l t’„ j -f- “[■ (j^^n ! 4" 

+ • • • + [130] 

N’ext, if 


<^ = Z Fn ~ convergent [131] 

1 

so that 

!<^«4-p ^’n+1 j 4" jt'n+l ‘’n+2i4“ *** 4" {t’n-f p— 1 [l'^2] 

for a finite n and all /^ = 1, 2, • • • x , and if further 

!c-„+p’ < €2 for all /> = 0, 1, 2, • • • [133] 

the condition 130 becomes 

LSn^p — 5",, I < A(ei 4 - 262 ) = 6 [134] 

for a finite n and all /^ = 1, 2, • • x . The series 124 is then convergent, 
and the conditions 129, 131, 133 constitute a test (known as the Dedekind 
test for conditional convergence) which may be used to reveal this fact. 

As an illustration the method may be applied to the scries 122. Here 
one may let 

fln = (-1)"-*, [135] 

n 

Then Sn = 1 or zero and hence remains bounded for all values of n 
(condition 129). Terms in the series 131 have the form 

\Vn - Vn+l\ = - - ^ 1 ) [ 136 ] 
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This typical term is smaller than l/»^, and since the series with X/r? as 
its typical term is convergent, the comparison test reveals the convergence 
of the series 131 in the present example. Finally the condition 133 is 
evidently met since Vn in 135 approaches zero monotonically vvith in- 
creasing n. Thus the conditional convergence of the series 122 is proved. 

In the majority of problems in which infinite series appear, the terms 
of the series are functions of some independent variable, and its sum is 
likewise regarded as a function of this variable. For example, in the power 
series 117 the quantity s may be assumed to have any complex value. 
It does not necessarily follow, however, that the infinite series represents 
a function of this complex variable since values of the series are not 
related to values of the variable unless the series is convergent. 

If for all points of a given region in the s-plane the series is convergent, 
it is said to be uniformly convergent in this region, and the series may 
then be regarded as there representing a function of the complex variable 
c. As the condition 118 shows, the power series 117 is convergent for all 
fx^ints lying inside the unit circle about the origin. This is its region of 
absolute and uniform convergence. If the power series is written 
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if the variable z is replaced by (s — Co) except that the convergence circle 
is then centered at the point = Sq. 

To say that an infinite series converges unifonnly at some point z = Z\, 
is equivalent to saying that the series converges not only at this point 
but also for all jxiints within a circle about Zi with a nonzero but arbi- 
trarily small radius. When this condition is met the term-by-term deriva- 
tive of the series yields a resultant convergent series that correctly 
represents the derivative of the function defineil by the given series, and 
an analogous statement may be made with regard to term-by-term 
integration. Within a region of uniform coin ergence one may, therefore, 
carry out the differentiation or integration of a function defined by an 
infinite series through applying the identical process to each term and 
summing afterward. The resulting series, however, may or may not 
possess the same region of uniform convergence. 

Power series have the interesting property that their term-by-term 
derivative or integral yields resulting series with the .same region of uni- 
form convergence. This fact may easily be proved, d'he series obtained 
through differentiation and integration of 137 are res|)ectively 

= 2 : and 5(-» = £ -^- ;"+i [141] 

-f 1 


The condition 101 applied to these series reads 


limit < 1 and limit 

n — > * 7! — » * 

Since 


1 1 / n 


n -f 1 




limit = limit 


1 


11 -h 1 


1 In 


= 1 


[143] 


(as may be seen through considering the logarithms of the expressions 
subjected to the limiting process n—*~r.) it becomes clear that the con- 
ditions 142 coincide with the condition 138, and hence that the series 
141 have the same convergence circle as the series 137. 

These matters pertaining to uniform convergence of series may be 
illustrated through considering the straight-forward expansion of some 
simple algebraic functions. For example, if one divides (1 -- s) into unity 
by continuing the process of long division, there results 

-I— = I + z + + z* + ••• . [144] 


which is the power .series 117. For values of z within the unit circle this 
series identically replaces the function 1/(1 — s), whereas for values of 
z outside this circle the series and the function 1/(1 — s) have entirely 



Ar/. P] 


POINT SETS AND INFINITE SERIES 


285 


different values. Near the point z — \ both the series and the function 
have values that increase without limit, whereas at the point 2 = — 1, 
Eq. 144 yields 

i = l + + + [145] 


The scries is evidently divergent at this point, for it fails to approach a 
limit. Hence there is no reason to become puzzled about this result since 
the function and the series may be identified only where the latter 
converges. 

However, in view of the fact that conditionally convergent series, 
through a particular arrangement of their terms, may be made to yield 
any desired sum, one is led to inquire whether a divergent series like the 
one in Kq. 145 may nevertheless be summablc through the application 
of some special process. 

This question, in the past, interested a number of mathematicians — 
Cesaro, Holder, Euler, Abel, Borel, etc. — and the summation procedures 
developed by them are connoted by the letters C, H, E, A, B, etc., 
respectively. Thus if, for example, a particular series is summablc through 
applying the Cesaro procedure, one abbreviates this statement by saying 
that the series is C'-summable. This particular process of summation is 
now discussed in some detail. 

With reference to the series 95 and its partial sums 96, consider the 
sequence 

5i = = Ux 


S2 


<^ 1 ~ f ~ -^2 



S3 


-Jo “h <^3 

”5 


2 1 


[ 146 ] 


o -^1 4- ^2 + • • • 4- -Ju , 2 \ I ^ 1 

n \ nj \ nj n 

which is referred to as an arithmetic mean sequence of the first order (if 
this sequence is subjected to the same process the result is said to be of 
second order, etc.). A limit of the sequence 146 exists so long as Sn 
remains finite for unlimited values of n. It is not necessary that the terms 
of the series 95 have the property 0 for w — » 00 , but, if they do, one is 

led to the result 

limit |i<i + (1 - - ) «2 + ( 1 - - ) «3 H 1- - «»| = limit (j„) [147] 

> 00 [ \ n/ \ nf n J n— ► » 

That is, the new sequence then apparently has the same limit as the 
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sequence of partial sums a conclusion that should, however, be ac- 
cepted with obvious reservations. 

For the series in Eq. 145 the partial sums are 

Si = 1, 52 = 0, S:i = 1, 54 = 0, • • • [148] 

from which it is clear that 

limit (5J = limit - iL£l'| == ^ r| 4 g] 

Thus the left-hand member of Eq. 145 is interpreted as the Ces^ro sum 
of the divergent series on the right. This type of summation wil! be 
used in connection with Fourier series in the following chapter. 


10. Taylor’s axd Maclaurix's series 


The point of departure in the present argument is Eq. 79, which is 
repeated below: 

Here the contour integral is assumed to be evaluated for a circular ])ath 
C with its center at some point Zi^. Writing 

_J 1 

f Z (s* Co) (C " Co* 

one obtains, by the process of long division, the .series 
1 1 . c - Co , rc - Co)- 

‘ D } 


f 


f ““ -0 (f ~ ^o)‘ 


(i 


[1511 


[152] 


which, according to the theory of infinite .series discussed in the previous 
article, is known to be uniformly convergent for 


< 1 

f - So 


ri55] 


Since the center of the circular contour C lies at the points Zqj and f lies 
upon the circle, this condition for uniform convergence is fulfilled for all 
points z within the circle. Because of its uniform convergence, the 
series representation for l/(f ~ z) may be substituted into the integral 
Eq. 150. and the integration carried out term by term. This process gives 


e- (r - Zo)'^ 

/(f) dt 


^2 ^ / f)«f . 

+ (2 - ZnY f 77 ^3 + 

-Z'’ (f - So)-’ 


[ 154 ] 
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A series representation for the function w = f(z) is thus obtained which 
reads 

f{z) = ao + ai (2 ~ zo) + aaCz - 2o)^ + ^ 3(2 Zof + • • • [155] 

with 




2TjJc (1 - 


[156] 


Equation 155 is recognized as Taylor’s series representation for the 
function f{z) in the vicinity of the point So, and Eq. 156 yields the 
coefhcients for this expansion. Since the validity of Eq. 150 requires 
that the function f{z) be regular throughout the region enclosed by the 
contour C\ and the series 152 or 154 is uniformly convergent only for 
points within this circular contour (because of the condition 153), it 
follows that the Taylor series 155 converges uniformly only within a circle 
about 2,1 whose contour reaches to the nearest singularity of the function 
Xc- = fiz). 

It is useful to note that, by means of Eq. 91, the relation for the 
coefik'ients of the Taylor scries expressed by Eq. 156 may be written in 
the alternate form 



which is recognized to agree with the familiar form used for the expansion 
of functions of a real variable. The Taylor scries is thus found to be avail- 
able in unaltered form to functions of a complex variable. The Maclaurin 
series evidently applies also to functions of a complex variable, since it 
is identical with the Taylor series for the special case Zo = 0. 

If the Taylor series, Eq. 155, is differentiated term by term, it is 
obser\'ed, according to the formula 157, tha the coefficients in the re- 
sulting series are those for the expansion of the function df dz about the 
point z = Zoy and hence that this result represents the Taylor series for 
the derivative of f{z). The truth of this statement may be shown through 
starting with Eq. 150, dilTerentiating under the integral sign (as per- 
mitted according to the conclusions of Art. 8), and substituting the square 
of the series 152 for the quantity I' (f - 2 )“, obtaining in place of Eq. 
155 a series for the function df;dz. 

One may conclude from this reasoning that the term-by-term dif- 
ferentiation of the Taylor series is permitted and yields the Taylor series 
representation for the derivative of the given function. Moreover, the 
resulting series has the same region of convergence as that for the given 
function, as may be seen from the discussion in the previous article or 
from the fact (brought out in the dosing sentence of Art. 8) that the 
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derivative of a function of a complex varial^le is aji;ain a function of a 
complex variable and possesses the same region of analyticity. From an 
obvious continuation of the same argument one sees that these conclusions 
apply to derivatives of any order. 

Collateral to these considerations is the fact that the Taylor expansion 
of a given function is unique. To demonstrate this fact, assume that a 
series representation having the form of Eq. IvSvS is given, and that this 
series is somehow known to represent the function f{z) within an arbi- 
trarily small region in the immediate vicinity of the point z = Co. The 
given series must be the Taylor expansion for /(c) about the point c = so, 
because the behavior of the function in this vicinity completely de- 
termines its successive derivatives there. That is, dilTercntiating the given 
series and substituting the results into Eq. 157 demonstrates that the 
coefficients are identical with those of a Taylor expansion about the same 
point. 

11. The principle of an.alytic contixu.ation 

One of the most interesting things about the theory^ underlying the 
Taylor expansion is that the mere knowledge of a function within an 
arbitrarily small region in the vicinity of some point Zo completely de- 
termines that function at all other points within the convergence circle 
about that point. The purjx)se of the present article is to call attention 
to the even more remarkable fact that a function of a complex variable 
is determined throughout the entire c-plane* from a knowledge of its 
properties within an arbitrarily small region of analyticity. t 

The first step in the process of carrying out such a determination is to 
write down the Taylor series about the point Zq where the function and 
its successive derivatives are known. Some other point s'o within the 
convergence circle for this Taylor series may then be chosen as the center 
about which a new Taylor series is determined. This choice can always 
be made, because the original series may be used for calculating the values 

•This statement is subject to some restriction when the function in ciuestion possesses 
what is known as a natural boundary, every point of which is a singularity. In such an c.xample, 
either the function does not exist beyond this lx>undary or its behavior there is governed by 
an entirely separate definition, in which ca.se it is perhaps more appropriate to say that one 
is dealing not with a single function but actually with two separate functions. In order for 
the reader to appreciate that these ideas are not merely of an academic nature, his attention 
is called to the fact that natural boundaries of the sort referred to here dp occur in practical 
problems. For example, the functions representing the fields in the cross-sectional plane of 
a wave guide possess as their natural boundary the walls enclosing these fields. 

fit is not even necessary to know the function at all points within an arbitrarily small 
region about zq; it is sufficient to know the values of the function, in the interior of the region, 
for all points of a finite but arbitrarily small line segment, or in a set of discrete points which 
have a limit point. 
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of the successive derivatives of the function in the point Unless every 
point on the original convergence circle is singular, the point can always 
be so chosen that the new convergence circle about js'o encloses a portion 
of the region lying outside the original circle. Since the two circles partially 
overlap, the tw'^o Taylor series have a certain convergence region in com- 
mon where the function is determined by either series. The second series, 
however, enables one to calculate values of the function at points within 
a circumscribed region lying beyond the boundaries which limit the 
representation of the function by means of the first series. One speaks of 
this process as an analytic continuation of the function into the newly 
acquired region. 

By properly choosing a third point s"o within this newly acquired 
region and using the second series for the calculation of the successive 
derivatives of the function in this point, one obtains a third Taylor 
series whose convergence circle encloses a portion of the s-plane which 
lies beyond the boundaries given by either of the first two convergence 
circles. The third Taylor series, therefore, represents a further continua- 
tion of the same function. 

In order to obtain such a further continuation it is, of course, not neces- 
sary that the third jx)int z^'q be located outside the first convergence 
circle, since the selection of this point may be regarded merely as a re- 
vision in the choice of the second point s'q. It should be easy to appreciate 
that this procedure may be continued in a variety of ways so as to obtain 
numerous overlapping convergence circles and corresponding series 
representations by means of which the function is ultimately determined 
within any desired region of the c-plane except for the singular points 
of the function . 

The process of carrying out an analytic continuation in this manner 
and of obtaining a succession of partially overlapping convergence circles 
which extend the known region for the fu.iction into a continuously 
expanding p>ortion of the ::-plane may be regarded as a process of succes- 
siv ely acquiring access to additional area in the s-plane after the fashion 
that a harvester, cutting grain with a scythe, successively acquires 
additional stubble ground by executing a continuous series of semi- 
circular slices. If for the moment one imagines this harvester to be doing 
a rather unsystematic job, it is conceivable that he may cut a swath or 
path which circles about a portion of the grain field and returns so as to 
overlap itself. If the process of analytic continuation is carried out in this 
manner, one expects, in the overlapping portion, to regain the same values 
of the function as were obtained previously. This, however, may or may 
not be the case, for the function may be multivalued, and in returning to 
the original portion of the s-plane one may find oneself located on a dif- 
ferent leaf of the Riemann surface’*' characterizing that multivalued 

*These matters are discussed in further detail in Arts. 17 and 18, 
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function. In fact it is possible to find, after returning to the original 
region, that a point at which the function was observed to be regular 
when this region was first encountered is now a singularity of the function. 

The significent matter here is that the process of analytic continuation 
is applicable whether the function is single- valued or not, aiul that even 
a multivalued function with all its manifold characteristics and peculiari- 
ties is completely determined from a knowledge of that function over an 
arbitrarily small region of analyticity on a single leaf of its Riemann 
surface. 

It follows also from these considerations that any two functions of a 
complex variable whose values coincide over an arbitrarily small region 
of analyticity, or for all points of a finite but arbitrarily small line segment, 
or for a set of discrete points having a limit {K)int, must have identical 
values throughout their common region of analyticity and hetice must 
there be identical. This statement is known as the identity theorem or 
uniqueness theorem for analytic functions. 

If the common region of analyticity }X)ssesses a natural boundary, 
nothing is implied regarding the behavior of the functions within other 
possible regions of analyticity. The theorem likewise does not imj)ly 
that the behaviors of the two functions are identical at isolated singulari- 
ties located within their common region of analyticity, but these matters 
are practically irrelevant inasmuch as a function is usable only where it 
is analytic, and no difiiculty can arise from assuming (if this be desirable) 
that the functions are identical everywhere else. 

The arbitrarily small line segment over which the function is initially 
known may be a portion of the real or imaginary axis in the c-plane. A 
real function of a real variable may be regarded as a function of a com- 
plex variable w'hose independent and dependent values both happen to be 
on the real axis. If it is possible to continue such a function into the 
complex domain, that continuation is unique and is immediately obtained 
by the simple expedient of replacing the real independent variable x by 
the complex variable z = x d-jy. The truth of this statement, expressing 
a property of functions known as their permanence of forniy follows di- 
rectly from the identity theorem, since the given real function and its 
continuation obviously coincide for points on the real axis. 

12 . Singular points and the Lalt^ent expansion 

Since the singularities of a function of a complex variable are, so to 
speak, the mainsprings upon which its very existence depends, it is usually 
of chief interest in the study of a given function that characteristics of 
the function be investigated in the immediate vicinity of these singular 
points. For this purpose the Taylor series is of little service, because the 
vicinities of singular points are the very regions where its convergence 
fails. Consequently it is of considerable importance to search for a type 
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of expansion whose sphere of usefulness is centered about a singular point, 
that is, an expansion which places the character of a given singularity 
in evidence. 

A few preliminary remarks concerning singularities will be appropriate 
preceding the detailed discussion of how such an expansion is found. 
Quite generally one must admit the possibility that a given function may 
be singular (nondilTerentiable or nonanalytic) not only at certain discrete 
jK)ints but also at all points comprising a finite region in the s-plane. 
If the latter is the case, and a particular point within this region is 
singled out, it is not fxissible to discover an immediately surrounding or 
neighboring space, however small, in which the function is analytic. 
In other words, the singular jx)ints within such a region are infinitely 
dense. 

For the considerations of the present article, such singular regions 
must be ruled out. Indeed, it is not {K>ssible to discover any kind of series 
expansion which can rejiresent a function in the vicinity of a pioint lo- 
cated within a region of this sort. From a practical point of view this is 
hardly discouraging, however, inasmuch as functions which possess such 
singular regions are seldom encountered in engineering work.* The present 
(lisi'ussion, then, will apply only to singularities for which it is possible 
to discover an immediately surrounding region in which no other singular 
jK)ints lie. 'fhese are (ailed isolated singularities. 

With reference to I'ig. 9, Zo represents a point at which a function 

= f{z) has an isolated singularity. About this point as a center are 
drawn a small circle c and a larger one C. The given function may have 
other singularities within the smaller circle or outside the larger one, but 
it is assumed to be analytic and single-valued at all points such as the 
.point z inside the annular space between the two circles. Cauchy's in- 
tegral formula, Kci- 79, is, therefore, applicable to the composite contour 
consisting of the two circles and the path j aning them, traversed con- 
tinuously as indicated in Fig. 9. Since the path joining the circles is 
traversed in both directions, the resulting form of Eq. 79 maybe written 


2TjJ{z) 


£ 

X (f - =) 


+ 


/(r) rfs- 
(i- - =) 


[158] 


in which the larger circle is traversed in the counterclockwise direction, 
the smaller one is traversed in the clockwise direction, and f refers to 
points on either of the two circles. 

For the integral over the large circle, the series 152 for l/ (f - s) 

*It may be [xiinted out in this connection that in terms of the analogy to physical flow 
fields (as discussed in .Vrt. S ) , the continuously distributed sources or vortexes (spatial distribu- 
tions of electric charge or current densities, for example) constitute just such singular regions. 
However, the solution of practical problems dealing with these situations fortunately does not 
require the series representation of functions within these regions. 
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converges uniformly because the condition 15v^ is fulfilled. For the integral 
over the small circle, this statement is no longer true, because the point 



Fig. 9. The composite contour to which Cauchy's integral formula is applied 
in the derivation of the Laurent expansion. 


z lies outside the small circle and hence 'c — Coj > |f — Co|. Here Eep 
151 must be replaced by 


_J -J 

f -o) ■“ (f ~ 2o) 


[159] 


whence a process of long division yields the series 

l_ 1 _ (j: - ^i)) _ - ::o)^ 

f — a z — Zo (a — au)~ (a — ao)^ 


[160] 


which is uniformly convergent for 


f 


~ -0 


-0 


> 1 


[161] 


It thus becomes clear that for 1 (f — a) one may substitute the series 
152 into the first integral in Eq. 158 and the series 160 into the second 
integral, and carry out both integrations term by term. Equation 158 
then becomes 


2rif(z) = i 


' £ MlH- 

(f - zo) ^ (r - zo)^ 

/(f) dk 


-iz-zor^f 


(f - Zo)-=' 


[162] 
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This result may be written in the form 
/(c) = A- bi(z — So) + ^ 2(2 ~ - 0 )^ + ^ 3(2 — So)^ -}-*•* 

+ i(s — So) ^ + 6—2(2 — So) ^ + 6_3 (s — So)~^ + • • * [ 163 ] 

in which 

'”'■-'>.>.2.- [i«) 

and 

(I- ’J- lor „ - - 1. -2, -3. . . . [165] 


Kquation 163 is the desired series representation for the function w = f(z). 
It is called the LauretU series. From the derivation just given it is clear 
that the series converges uniformly within the annular region between 
the two circles shown in Fig. 9 where the function w is analytic. Thus the 
radius of the outer convergence circle extends from Sq to the nearest 
singularity beyond the circumference of the smaller circle, and the radius 
of the inner convergence circle extends from Sq to the farthest singularity 
inside the larger circle. If the singularity at Sq is the only one inside the 
larger circle, the radius of the smaller circle may become vanishingly 
small, and the Laurent series is then seen to represent the given function 
in the immediate vicinity of the singularity at Zoy that is, to represent an 
ext)ansion of = f{z) about this singularity as a center. 

In connection with the formulas 164 and 165 for the coefficients of the 
Laurent scries it sht)uld be observed that the direction of traversal about 
the large circle C in the integral 164 is counterclockwise, whereas that 
about the small circle c in the integral 165 is clockwise. If the latter direc- 
tion of traversal is reversed, the algebraic sign in Eq. 165 changes from 
minus to plus. The formulas 164 and 165 then differ only in that the first 
is evaluated for the circumference of the larger circle and the second is 
evaluated for the circumference of the smaller one. Since the function 
w = f{z) is analytic in the region between the two circles, the values of 
the integrals 164 and 165 are the same for any closed contour within the 
annular region or coincident with either circle. Hence the formulas 164 
and 165 may be replaced by a single one which reads 


b 


n 


J_ 

ItJJs (f-Sor+‘ 


[ 166 ] 


in which 5 is any dosed contour within the annular region or coincident 
with either circle, f refers to points on this contour, and the latter is 
traversed in the counterdockwise direction. Since the function w = f{z) 
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is singular in the point So, Eq. 91 does not apply for z = Zq and hence 
there exists no alternative differential formula for the cocflicicnts 
similar to Eq. 157 for the coefficients of the Taylor series. This circum- 
stance is a practical disadvantage because integration is, as a rule, more 
difficult than differentiation. Consequently, when a Laurent expansion is 
to be found, the coefficients are determined, wherever possible, by other 
expedients. Further discussion of this point is given later on. 

When the form of the Laurent series, Eq. 163 is contrasted with that 
of the Taylor series, Eq. 155, the principal difference is obser\’ed to lie in 
the fact that the Laurent series contains both descending as well as 
ascending powers of the variable (z — Co), whereas the Taylor series 
contains only the ascending powers. The jwrtion of the Laurent series 
involving the ascending powers only is called the ascending part of the 
series, and the portion inv’olving the descending powers is called the 
descending part or principal part. It is the princij^al part of the Laurent 
series which places in evidence the singularity of the function w = f(z) 
at the point ro- 

The ascending and descending parts of the Laurent expansion may be 
written respectively as 

Mz) = b„{z - Zq)’* [167] 

n »0 

and 

M^) = X b„{z - Zo)" [168] 

n => —1 

whence Eq. 163 for the Laurent series becomes 

m ^Mz) +f2(z) [169] 

Because the series 152 (which leads to the ascending part fi) converges 
uniformly for all z- values within the larger circle and the series 160 (which 
leads to the descending part / 2 ) converges uniformly for all z-values out- 
side the smaller circle (Fig. 9), it follows that the series 167 converges 
everywhere within the larger circle whereas the series 168 converges 
everywhere outside the smaller one. Both series converge within the an- 
nular region between the two circles (this is the common portion of the 
two separate regions of convergence), and this, therefore, is the region of 
convergence for the sum of the two series 167 and 168, which is the 
Laurent expansion. 

The function /i(z) is analytic everywhere within the larger circle, and 
the series 167 is its Taylor expansion about the point z = zq. The function 
/ 2 (z) is analytic everywhere outside the smaller circle. To interpret the 
series 168 for/ 2 (z), it is helpful to consider for the moment a change of 
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variable which amounts to replacing {z — Zq) hy w 1/(2 — Zq) which 
amounts to interchanging the roles of points Zq and oo. The series 168 
then becomes one involving only ascending powers of the new variable 
w and represents a Taylor expansion of the function /2(s) about the point 

= 0 (which corresponds to a Taylor expansion about the point 2 = oo 
according to the change of variable considered). It is helpful in this 
reasoning to think in terms of the complex sphere rather than the complex 
plane, for then the points z = Zq and 2 = ^ are simply two points on the 
sphere and the interchange of the parts which they play is easier to 
visualize. One is thus led to recognize that the descending series 168 may 
be regarded as a Taylor expansion of the function 72(2) about the point 
2 = 00. 

According to this interpretation the Laurent series represents the given 
function by means of two Taylor series, one about the point 2 = 20 (this 
is the ascending series for /i) and one about the point 2 — » (this is the 
descending scries for /o). The function fi{z) contains only those singu- 
larities of f{z) which lie outside the larger circle about 20*, /2 (2) contains 
only those singularities of f{z) which lie within the smaller circle. This 
circle may alternatively be regarded as one which is drawn about the 
point 2 = oc as a center, whence the region “ within the circle (actually 
the region outside the smaller circle) becomes the region of analyticity 
for the function /2(2). The change of variable indicated by (2 — 20) 

I, (2 — 2o) evidently interchanges not only the points 2 = 2o and 2 = w 
but also the roles played by the functions fi{z) and 72(2) and their series 
representations as given by Eqs. 167 and 168. 

From the uniqueness theorem discussed in the preceding article it fol- 
lows that the Laurent expansion is unique, since two series representations 
having identical regions of convergence and yielding identical values for 
all points within this region must be identical. This fact has practical 
significance, particularly with regard to the Laurent expansion, because 
it means that if a representation having the form of Eq. 163 is found for a 
given function in the vicinity of one of its singularities, this representation 
must be the Laurent series for that region irrespective of the method by 
which it is determined. In other words, the coefficients need not be 
calculated by means of the formula 166, but may be found by any other 
expedient which proves to be most effective in the given circumstances. 

13. Kinds of singularities and the classification of functions 

IN TERMS OF THEM 

If the smaller circle c of Fig. 9 encloses no other singularities except the 
isolated one at the point 2 = 2©, the principal part (Eq. 168) of the 
Laurent series (Eq. 163) characterizes the nature of that singularity. If 
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the principal part contains a finite number of terms, the singularity is 
referred to as a pok. The principal part may then be written 

h(z) = + • • • + [^70] 

in which the highest negative exponent 5 is called the multiplicity ox order 
of the pole. A pole of the first order is also called a simple pole. 

The principal part h{z) may have an infinite number of terms, in which 
case the function f{z) is said to have an essential singularity at the point 
z = zq. At a pole the function f{z) becomes infinite, but in the vicinity 
of an essential singularity the function may assume any assigned value 
depending upon the manner in which this singularity is approached.* The 
function w = for example, has an essential singularity at the point 
s = oof, and the function w = has essential singularities at the 

points 2 = 0, T, 2t, • • • . For the function 

f{z) = e* = ^ (cosy +j sin y) = w + jv [171] 

one may recognize this peculiarity by noting that 

rj72] 

and 

tany = - [173] 

u 

Consi<3fer an arbitrary choice of values u and v. It is then possible to allow 
2 to approach infinity along a path, parallel to the y-axis, designated by 
X = In Vu^ + since this value of x satisfies Eq. 172. Equation 173 
may also be satisfied along this path for an infinite number of values of y 
which tend to infinity. Also, it approaches infinity along the negative 
real axis, e* becomes zero, and if z approaches infinity along the positive 
real axis, becomes infinite. 

Not only is a pole a milder form of singularity, but the behavior of the 
function in its vicinity is a very definite one. For a pole of multiplicity 5, 
multiplication of f{z) by the factor (2 — 20 )* yields a function which is 
regular in the point 2 = 20 . This circumstance may be regarded as a test 
whereby an ordinary pole may be distinguished from any other kind of 
singularity. 

Other kinds of singularities, particularly those found in multivalued 
functions (branch points), are discussed in a subsequent article. In the 
meantime it is useful to point out how certain types of single- valued func- 

*Theorein of Casorati-Weierstrass. 

fThis characteristic of the function e* may be recognized from the fact that the Taylor 
series, which in this case converges uniformly in the entire z>plane, has an infinite number of 
terms. Note also the discussion of entire funcUons immediately following. 
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tions may be classified according to the poles or essential singularities 
which they possess. Once more it is emphasized that the singularities of 
a function are the mainsprings of its existence. Without singularities of 
any kind, an analytic function reduces to a constant. 

In this classification one may begin with that type of function which is 
singular only in the point at infinity. Such a function is regular in the en- 
tire finite z-plane. It is called an entire function or also an integral func- 
tion, and it may be denoted by I(z). There are two kinds of entire 
functions; for one of these the singularity at infinity is an ordinary pole, 
and for the other it is an essential singularity. The first of these functions 
is more particularly referred to as an entire rational; the second, as an 
entire transcendental function. 

Since the entire function is regular in the entire finite z-plane, it pos- 
sesses a Taylor series representation for all finite points, which con- 
verges uniformly within the entire 2-plane. If the function is transcen- 
dental, such a Taylor series contains an infinite number of terms. The 
functions sin 2, cos 2, and e^ are conunon examples. An entire rational 
function, on the other hand, possesses a Taylor series representation 
having a finite number of terms, the highest power of (2 — 20) being equal 
to the order of the pole at infinity. The entire rational function, therefore, 
is an ordinary (finite) polynomial; that is, 

7(2) = P(z) = Oo + fli(2 - 2o) + ^2(2 “ ZoY H h an{z - 20)’' 

[ 174 ] 

in which n is the order of the pole at infinity. For n = 0 , the function is 
also regular at infinity, and in this case reduces to the constant Oq. 

A second important class of functions are those referred to as mero- 
morphic. They may be defined as given by the ratio of two entire fimc- 
tions; thus 

Since 72(2) is a finite or an infinite polynomial, it may become zero at a 
finite or at an infinite number of points in the z-plane. If 72(2) is thought 
of as factored in terms of its roots (these are called the zeros of 72), it 
becomes clear that, at these points, M(z) has ordinary poles whose 
orders equal the multiplicities of the roots of 72(2). The factored form of 
72(2) is its finite or infinite product representation, and this form places 
the poles of the function M (2) in evidence. 

At the point infinity the meromorphic function has an essential singu- 
larity if either or both of the entire functions 7 i and 1 2 are transcendental. 
Af (2) is then said to be transcendental also, but at no finite points in the 
2 -plane can this function have singularities other than ordinary poles. 
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The function = tan s is a common example of a meromorphic function 
which is also transcendental. It has an infinite number of simple poles in 
the s-plane and in addition has an essential singularity at infinity. 

When both Ii and I2 are rational (given by finite polynomials), the 
resultant meromorphic function is also rational. At the point infinity, the 
function then has at most an ordinary pole. In other words, a mero- 
morphic function whose singularity at infinity is an ordinary pole, is a 
rational function. Rational functions, then, are such having no other 
singularities except poles. Inasmuch as the representation of them is given 
by 

*(-) . ^ [.76] 

in which Pi and Po arc finite polynomials, it follows that a rational func- 
tion has a finite number of poles. 

14. Zeros and saddle points or points of stagnation 

If in the Taylor series representation for the function w = J{z) as given 
by Eq. 155 , the constant term Jq is zero but Ui is not zero, the function is 
said to have a simpk zero in the point z = Zq. For the immediate vicinity 
of this point, that is, for (s — So) < 1 , the function is approximately 
represented by the term ai{z — z^) alone. The reciprocal function, 
l//(z), has a simple pole in this same point, for its representation in this 
vicinity is approximately given by l/di(s — Co)- 

If both Oq and ai are zero and ao is different from zero, the function is 
said to have a zero of the second order in the point z = Zq. For the imme- 
diate vicinity of this point the function then is approximately represented 
by the term ^2(2 — ZqY. The reciprocal function similarly is approxi- 
mately represented by l/a2(s — ZqY and has a pole of second order in 
this point. 

In general a zero is said to be of the order 5 if the reciprocal function has 
a pole of order s in the given point. This is the case if the first nonzero 
coefficient in the Taylor series expansion for the function is a,. According 
to the formula 157 for the Taylor coefficients, this condition results if the 
fimction and its first s — I derivatives all vanish at the point z = so- 

It is also possible that the first 5 — 1 derivatives of the function are zero 
at the point 2 = Zo but that the function itself is not zero there. That is to 
say, all the coefficients ai, 02, and so forth up to and including are 
zero, but oo is not zero. In this case the function /(z) — ao, or w — Oq, 
has a zero of the order 5 in the point a = Zo> but the function w obviously 
is not zero there. Except for the additive constant Oo, the behavior of the 
function w in this vicinity is, however, clearly the same as though this 
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point were an 5th order zero. The terminology used to refer to such points 
must be so chosen, however, as to distinguish them from zeros. For reasons 
of physical interpretation, to be discussed in the following paragraphs, 
they are referred to as saddle points or also as points of stagnation* 

For the immediate vicinity of a saddle point of the order 5 — 1, one 
hast 


= w + > ^ ao -h a^z - ZoY [177] 

The point is a zero of order 5 if = 0. The following detailed discussion, 
in which the constant is dropped, applies to either zeros or saddle 
points. It is convenient to write 

{z - So) = [178] 

whence 

w - u jv ^ = a^r^icos s<t> +j sin s<l>) [179] 

Equating real and imaginary parts gives 

u ^ OsK* cos s<t> [180] 

V ^ dg/" sin s<f> [181] 

For the graphical representation, in the s-plane, of the loci for 
u = constant and v = constant, according to the discussion given in 
Art. 5, the relations 180 and 181 are helpful in showing the character of 
such loci in the vicinity of the point So- 
These loci are shown in Fig. 10 for the cases 5 = 1, 5 = 2, and 5 = 3. 
The corresponding sketches in Fig. 1 1 show how the algebraic signs of the 
quantities u and v change in the vicinity of the point s = sq. The pictures 
in Fig. 10 may be regarded as depicting the direction of the gradient 
{u = constant) and the lines of constant iltitude {v = constant) in a 
mountainous terrain. The picture for 5 = Z, for example, is then seen to 
represent the vicinity of a point which is simultaneously the top of a 
ridge and the bottom of a valley, that is, a mountain pass where a valley 
crosses a ridge. The terrain in such a region evidently has the shape of a 

•The German terminology (of which these are translations) is “ Sattelpunkt ” or 
“ Staupunkt.” Alternatively the term “ Kreuzungspunkt *’ is also used. 

tThe convention of designating the order of a saddle point as being 5 — 1 when a zero 
having the properties is referred to as being of the order ^ arises from the fact that the 
inverse function has a branch point where the given function has a saddle point, whereas the 
reciprocal function (not to be confused with the inverse function) has a pole where the given 
function has a zero. Just as the order of a pole receives the same designation as that of the 
zero of the corresponding reciprocal function, so the order of a saddle point receives the same 
designation as that of the branch point of the inverse function. These matters are discussed 
in greater detail at the end of Art. 18 after the method of dealing with multivalued functions 
is presented. 
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saddle. This fact accounts for the appropriateness of the term saddle 
point. 

For ^ = 3 the terrain has the shape of a saddle which might be designed 
for a three-legged person, or one may say that it is a region where three 
ridges and three valleys meet in a common point. 

Using a hydrodynamic analogy, one may regard the curves for 
u = constant as representing the direction of fluid flow, and the curves 
for V = constant as designating the orthogonal set of contours along 
which the gradient is zero. The fluid is streaming symmetrically toward 
and away from the point Zq. At this point the fluid is stagnant, thus 



Fig. 10. Loci of constant real and imaginary parts in the vicinity of saddle points 

of various orders. 

suggesting the term point of stagnation as an alternative designation. 

The pictures show, moreover, that if the order of a zero is greater than 
unity (or that of a saddle point is greater than zero), the orthogonality 
of the contours for u = constant and v = constant fails in the point 
z = 2o, since the respective curves there intersect at an angle of t/2s 
radians. This failure is, however, only apparent inasmuch as the contours 
actually do not pass through the point but are bent sharply at it. Fluid is 
deflected at the point Zq instead of flowing through it. 

From the standpoint of conformal mapping there is also an apparent 
failure in the angular relationships at a zero order greater than one (or at a 
saddle point of nonzero order). If, in addition to the polar representation 
of {z — Zq) given by Eq. 178, one also writes 

w - oo =* [182] 

in which Oq may or may not be zero, Eq. 177 shows that in the immediate 
vicinity of the point z-zq one has 

P = 


[183] 
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and 

e = S4> [184] 

Since (2 — 20 ) niay be regarded as a small path increment radiating 
from the point Zq in the z-plane, and w may similarly be regarded as the 



Fig. 11. Algebraic signs of real and imaginar>' parts in the vicinity of saddle 
points of various orders. 

conesponding small path increment radiating from the point Oq in the 
it^-plane, one observes, according to Eq. 184, that if the increment (z — zq) 
is rotated through an angle A</>, the corresponding increment w rotates, 
not through the same angle (as the conformality ordinarily requires), but 
through an angle s times as large. 

This apparent failure in the preservation of angular relationships is 
clarified by the recognition that the inverse function z = i/{w) is multi- 
valued and that the point z = Zo is a branch point of the order 5 — 1 for 
this inverse function. The discussion of conformal mapping in Art. 2 
points out that such a pair of maps in the w~ and z-planes is a graphical 
representation not only for the given function but also for the inverse of 
it. Consequently one must recognize that although the function given by 
Eq. 182 is single-valued, the multivalued character of the inverse function 
cannot be ignored. 
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Although the detailed properties of multivalued functions is left for 
discussion in Arts. 17 and 18, it is appropriate to point out here how one 
may see that the failure in the preservation of angular relationships is 
only an apparent one. If the point in the c-plane is enclosed by a 
sufficiently small circle c, the corresponding locus in the Tc-plane is very 
nearly a small circle c* surrounding the point (which corresponds to 
Zq). As a point traverses an arc on the circle c, the corresponding point 
on c' traverses an arc s times as great. In Fig. 12, a and b are two small 
radial line segments emanating from the circle c, and a' and 6' are the 
corresponding segments in the ^t‘-plane. The angle 6 between a' and b' is 5 




Fig. 12. Apparent failure in the preservation of angular relationships at a saddle 

point. 

times that between a and b. If the radii of the small circles c and r' arc 
allowed to become still smaller, the line segments appear to radiate from 
the points Zo and Wq, and one is led to conclude that the preservation of 
angular relationships has failed because one’s attention is focused upon 
the angles </> and 6 rather than upon the angles between the line segments 
and the circular arcs, which remain equal to 90 degrees. However, if one 
mentally visualizes the situation in the limit as though the small circles 
were still there, it becomes clear that what has happened is not a failure 
in the preservation of angular relationships but rather is the result of a 
peculiar and somewhat misleading behavior of the given function in the 
vicinity of the point z = Zq. This view is borne out by the fact that the 
Cauchy-Riemann equations, guaranteeing the uniqueness of the deriva- 
tive, still hold in this point. 

15 . The evaluation of contour integrals; Cauchy's residue 

THEOREM 

If the formula given by Eq. 166 for the coefficients of the Laurent 
expansion is written for the integer value n = —1, it reads 

/(z) dz = 


[ 185 ] 
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The contour S is in the present discussion assumed to enclose a region 
within which the function f(z) has but one singularity at the point Zq. 
This situation is indicated in Fig. 13. If the coefficient b^i can be deter- 
mined in some way (for example, as described subsequent!)'' in this 
article), Eq. 185 represents a means for evaluating the contour integral 
for the function f(z) extended around a 
given closed boundary 5. 

According to the Cauchy integral law, as 
discussed in Art. 6, the value of this contour 
integral is zero if the given function /(s) is 
regular at all points enclosed by the con- 
tour. The present result substantiates this 
fact, for if f{z) is regular also in the point So, 
the Laurent e.xpansion about this point has 
no principal part yii becomes identical with 
the Taylor expansion) and hence b^i = 0. 

The present result may be said to repre- 
sent a completion of the Cauchy integral law 
in the sense that it yields the value of the 
contour integral whether the function is regular within the enclosed re- 
gion or not, and hence it contains the integral law as a special case. 

An interesting alternative way of obtaining this same result is to begin 
by assuming that the contour integral is given and that an evaluation of 
it is sought. Since the singularity at Zo is the only one within the region, 
the contour 5 may be replaced by a circle C about Zq according to reason- 
ing similar to that used in Art. 7 in replacing the integral 72 by the 
integral 73. In other words, the path of integration may be deformed or 
contracted so long as no part of it is allowed to sweep over a singular 
point. Thus 

£fiz)dz = £f(z)dz [186] 

Now /(z) may be replaced by its Laurent exp)ansion which reads 

/(s)= i bniz-zor [187] 

n = — « 

Because of the uniform convergence of this series, the integration may be 
carried out term by term, giving 

£ /(z) dz= 

If the radius of the circle C about Zq is denoted by p, then 

{z - Zo) = 



Fig. 13. The integral of a 
fiinction about S is de- 
termined by the value of the 
residue of the pole at zo. 


[189] 
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and 

dz = jpe^* d<t> 

[190] 

Equation 188 becomes 



£f(z)dz 

„ - _ » Jo 

[191] 

But 



jrv— l:;:;:! 

[192] 

Hence 

£ f(z) dz = 

[193] 


which agrees with Eq. 185. 

The coefficient h^\ is called the residue of /(c) in the point c = co- 
The value of a contour integral enclosing a singularity is, therefore, 

equal to lirj times the residue of the func- 
tion in this singularity. This result is re- 
ferred to as Cauchy's residue theorem. 

WTien the contour 5 encloses more than 
one singularity, that is, if the function /(c) 
is singular at several points Ci, C 2 , • • • Zk 
within the enclosed region, as indicated in 
Fig. 14, the contour S may be replaces! 
by k separate contours each enclosing one 
of the singularities, and the value of the 
contour integral around 5 is seen to be 
given by the sum of individual contour 
integrals around the k separate contours. It becomes clear that in 
this case 

/ /(s) dz = + • • • + [194] 

in which • • • are the residues of f{z) in the points Cj, * * 

respectively. 

The contraction of the contour S to the several separate contours 
about Zi, Z 2 t • • • may be visualized through supposing 5 to be a rubber 
band which is shrunk in the manner indicated in Fig. 15. The contribu- 
tions coming to the net result from those portions of the shrunken contour 
which in the limit become superimposed and are traversed in opposite 
directions evidently cancel. 



Fig. 14. The integral of a 
function about S enclosing 
several poles is flctermined by 
the sum of the residues. 
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As a practical means for evaluating a given contour integral, of course, 
this method is useless unless some way is found for determining the 
residue, which is the coefficient of the first term in the principal part, 
Eq. 168, of the Laurent expansion. When several singularities are en- 
closed by the contour 5, the residues of the function j{z) are usually 
found separately for each of the singu- 
larities. If it is |X)ssible to find the first ^ plane 
term in the principal part of the Laurent 

expansion for the immediate vit inity sur- ^ 

rounding each singularity, this objective / ^ 

is accomplished. ^ 

The following method for evaluating ' 

the residue is useful in many cases. If 5 - 

the singularity of fiz) in ::o is an ordinary , c . -u f u 

pole, the reciprocal of this function 

\ ^ for separately by shrinking the 

0(c) = — T [195J contour. 


is regular in Co and may there be expanded in the Taylor series 
0 ( 2 ) = 0(Co) “f-0^(-()) * (-~'2o)+§0^ («o) * 

in which the primes denote differentiation with resp)ect to z. 

The detailed process now varies according to the order of the pole of 
f(z) in Co- If this jxfie is of the first order, 0(2:0) = 0 , but <f>'(,Zo) is not 
zero. Division of the series 196 into unity (by the ordinary process of 
long division) yields 

- /(») - 5 

11 (0")^ _ 1 ;^\ r, _ - \ . ri071 


1 <t>" , .-,0 

2 (♦')* *' ^ 




This is the Laurent expansion of /(c) about the point Zq, Hence the 
residue in this case is 

If the pole of/(s) in Zq is of the second order, «(zo'' =0 and <l>'{zo) = 0, 
but <t>”{zo) 9 ^ 0. Division of the series 196 into unity then reads 




, v-o 2 ^ „ .-X 

(z - 2 o) * - 3 '“>) 


12 («'")" 

+ t9W 


1 <t>"" 

6 {<t>'T 


iZ - 2 o)“ - 
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from which the residue is seen to be 



[ 200 ] 


WTien the pole is of higher order, the expression for the residue becomes 
increasingly more complicated, but the method of evaluation remains 
the same. 

The detailed aspects of the process of evaluating residues may, of 
course, be varied in a great many ways, and the most expeditious course 
depends entirely upon the form of the s|X'cilic function in hand. Addi- 
tional ways of approaching the pro!)lem may, in the course of the solution 
of specific examples, suggest still other variations. 

If f{z) has an 5 th order pole in c — Zn, then 

^(z) = (z-ZoYfiz) [201] 

is regular in this point. Hence it {x>sscsses the I'aylor expansion 

}p{z) = ^(zq) + (zq) • {z — Zq) -f — :;o)* 4 " • • • [ 202 ] 


Since 


m 


yPiZ) 

~{Z - ZoY 


[204] 


substitution of the Taylor series 202 for \l/(z) yields the Laurent expansion 
of f{z) about Zq. It is then clear that the residue of f{z) in Zq is given by 



Sometimes the function ypiz) in K(|. 201 is more conveniently e.xprcssed 
as the product of two simpler functions: 

Hz) = ^{z) . viz) [205] 

Suppose that 

iiz) = ao + ai(s — Zq) + a2iz — Zq)^ -p . . . [206] 

and 

viz) = + Pliz — Zq) + P 2 iz — Zq)^ -f * • • [207] 

are the Taylor expansions for these functions about s = ::o* Since the 
residue of fiz) in Zq is the coefficient of the term containing (2 — 
in the product v, it is seen that 

6-1 = + * • ' + [208] 

In numerous practical problems the function fiz) in the integral 185 
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has a finite number of orrlinary poles in the finite z-plane, all of which are 
enclosed by the l)oun(lary 5. At the point infinity, the function may or 
may not be singular. If the complex plane is thought of as replaced by the 
surface of the complex sphere, it is possible to regard the contour S either 
as one enclosing all the singularities of J{z) which occur for finite z- values 
or as one which encloses none of these singularities but merely surrounds 
the ix)int at infinity. In other words, the region which ordinarily is re- 
garded as being external to the contour S may alternatively be inter- 
preted as being the enclosed region. The latter is, of course, traversed in 
the opix)site sense, but for the moment this fact is of secondary impor- 
tance. If it happens that the function J{z) is regular at infinity, one is 
confronted with the peculiarity that the integral around a contour 
enclosing no singularities is nevertheless not equal to zero. The unique 
feature alx)ut this situation, however, is that the region in question 
contains the point at infinity. Hence one must conclude that the residue 
of a function in the ix)int at infinity is not necessarily zero if the function 
is regular there. In fact, the residue of f{z) at infinity is equal to the 
negative sum of its residues in all its singularities which occur for finite 
c- values. 

A simple example may illustrate this point more specifically. Suppose 
f{z) = 1/z. This function has a simple pole at z = 0 (with the residue 
unity) and is regular ever\'where else. At infinity the function has a 
simple zero. A circular path enclosing the origin may alternatively be 
regarded as a circular contour enclosing the point at infinity. If these 
paths are separately traversed in their counterclockwise directions, the 
values of the resulting integrals are dz 2^7 respectively. Notwithstanding 
the fact that /(z) is regular at infinity, it is seen that the contour integral 
. enclosing this ix)int has a nonzero value. 

Conversely, one cannot conclude that the i^Uegral has a nonzero value 
for a contour enclosing the jx^int at infinit} if the function there has a 
simple pole. For example, if /(z) = z, the contour integral evidently has 
the value zero. 

The residue of a function in the point at infinity cannot be evaluated 
by any of the processes which apply to finite points. One might suppose 
that such an evaluation could be accomplished through first introducing 
the change of variable indicated by the substitution f = 1/z, which inter- 
changes the origin with the point at infinity, and then proceeding in the 
normal fashion. Again the above example for the function /(z) = 1/z 
shows that this method is obviously incorrect. 

16. The partial fraction expansion of rational functions 

If /(s) is a rational function, then, as pointed out in Art. 13, it has a 
finite number of poles in the entire s-plane. This number, the pole at 
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infinity bein" excludcci if present, may be denoted l)y ;/» and the i:-values 
corresponding to the fX)les of /(:;) by Ci, Zj. • 'Fhe principal parts of 
Laurent expansions for the function aliout the {ku'iUs Zi, Zo. • • are 
denoted by //i, //o, • • • //„ respectively. I'he function 

f(z) - //i(:) - h..(z) - • ■ . - li„{z) = giz) [209] 


must be an entire rational function, that is, a iKilynomial in c whose 
highest power equals the order of the |K)Ie of /(c) at infinity. 'I'his fact is 
recognized through observing, for example, that /(c) — li\(z) must be 
regular at the point c = Ci because its Laurent expansion about Z\ has 
liO principal part and hence is a I'aylor exjiansion. However, the function 
/(c) — //i(c) still has poles at the |K)ints co, C;i, • * • c„. Next, the function 
/(c) — /ii(c) — lioiz) is seen to be regular at the i>oints Ci and Co, and 
therefore has only the poles at C;i. C4, • • • c„. and so forth. 

Transposing the principal parts in Kq. 209 to the right-hand side, one 
obtains the representation 

/(c) = //i(c) -f //2(c) + . . . -f //„(c) -f }i(z) [210] 


in which each term places one of the |>oles of /(c) in evidence. It is an 
explicit representation of the function in the form of a linear superjxisition 
of the individual contributions of its singularities, d'his representation 
for the function /(^c) is known as its partial fraction expansion. 

More specifically, if 


piz) ^ Oro 4- rtjC 4“ « 2 C“ -f- . • • 4- 
q{z) fio 4- /ii- 4“ doC" -f- • • • -f 


[ 211 ] 


and it is assumed that all the roots of (/(c) are di.'itinct, all the jxiles of /(c ) 
are simple. These roots may be denoted by Ci, c^, • • c„. The principal 

parts of /(c) in its poles then have the form 

h 

//.(c) = f-f^ [212] 


in which are the corresjxmding residues. 

Applying Eq. 198 of the previous article, and noting that according 
to Eq. 195 <p{z) is in this example <l> = g p, one finds 





[213] 


The derivative of q(z) for s = c„ may be further evaluated through noting 
that the factored form of this polynomial reads 


q^z) = 0„(z - z,)(z - Z 2 ) • • • (z - z„) 


[214] 
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and hence that 



= /3„(S, - 3i)(3, - Zi) - ■ ■ (Z, - 2,_i)(3, — Z^i) • • • (Z, - Z„) 


[215] 


which may altcniatively be written 





[216] 


The way in which Eq. 215 is arrived at may readily be seen through 
first regarding q{z) in Eq. 214 as being in the form 


qiz) = (Z - Z.) ■ q*{z) 


l217] 


in which q*(z) is the ri^ht-haiKl side of Fci. 214 with the factor (s — s„) 
missing. Xow applying the rule for the derivative of a product to ^( 2 ) as 
represented in Kq. 217, one linds that 


drj 

dz 


= (- 


=.)f +,•(=) 


[218] 


and hence that 


which agrees with Kqs. 215 and 216. 

In view of these considerations it becomes clear that the residues as 
given by Eq. 213 may alternatively be written in the form 

= [fz -=.)•/(=)].=.. [220] 

These results are unchanged if the ix)lynomial ^(z) has a zero root, that 
is, if 0a = 0. They are restricted to the case jf simple poles, of course, 
since the roots of q{z') arc, in the above anal .sis, assumed to be distinct. 
\Yhcn multiple roots occur, methods similar to those discussed in the 
previous article for evaluating residues at multiple-order poles must be 
used to tletcrmine the principal parts ltw{z) in Eq. 210. 

If all the poles of/(z) arc assumed to be simple, the results stated by 
Eqs. 212 and 220 determine the partial fraction e.xpansion 210 except for 
the rational integral function gfz). This function is found from the form 
of fiz) given by Eq. 211 in the following way. If m ^ n, then /(z) is 
regular at infinity, and the entire rational function g(z) reduces to a 
constant. More specifically, if tn = then 



whereas if m < », then g(s) is identically zero. 


[ 221 ] 
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On the other hand if w > w and m — ;/ = 5 , then f{z) has an 5th order 
pole at infinity. The function g{z) is found through dividing g{z) into 
p{z) by long division so as to get 

+ • • • + 7iS + 70 + t222] 

9 (=) 9 (=) 

in which the remainder polynomial p*(z) has the degree w — 1. Then 

gi=) = 7o + 7i2 + • • • + ys^" [22v?] 

For m = n this process yields = 70 = whereas for m < n 

it is clear that = 0 as stated above. 

These considerations amount to putting/!::) into the form 

/(s) = g(-) + -!-/*(-) [224] 

in which/*!::) has a simple zero at infinity but contains the same p)les 
as/(s) for the rest of the c-plane. In these |X)les /*(::) has the same prin- 
cipal parts hy{z) as the function f{z), and these f)rincipal parts are found 
according to Eqs. 212, 213, and 220 by u.se of either p{z) and /(c) or 
p*{z) and /* ( 2 ) in these expressions, whichever appear to be more ex- 
pedient. 

17. Multivalued functions; branch points and Riemann 

SURFACES 

A multivalued function with which the reader undoubtedly has some 
acquaintance is the logarithm.* This function is defined by the integral 

in which the path of integration in the ^plane extends from the point 
f = 1 to the point f = 2 , but otherwise remains arbitrar>\ The integrand 
is the function 

/(f) = J [226] 

which is regular everywhere except at the origin (f = 0) where it has a 
simple pole with the residue unity. 

Hence the value of the integral is not affected by a deformation of the 
path of integration so long as no portion of this path is allowed to sweep 

•Unless otherwise specified it is undentood that the natural logarithm is implied. This is 
denoted by In z to distinguish it from the Briggs logarithm, which is written log z. 
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across the origin. With reference to Fig. 16, one has 


Ji\ f J/>2 f Jl f 


[227] 


in which the subscript 1 on the closed contour integral indicates that the 
origin is encircled once in the counterclockwise direction. From Art. 15 
it is seen that therefore 



[228] 



Fig. 16. The integral on elose<l 
path A /*• yields 2x; since the 
branch j>oint at the origin is en- 
circled once in counterchx kwise 
s<'nsi‘. 



Fig. 17. The integral on closed 
path P\~Pi again yields 1-nj. Path 
is different from that in Fig. 16 
but sense around branch p>oint is 
the same. 


The same result is true for the two alternative paths shown in Fig. 17. 
In. general, if the path Pi encircles the origin in the counterclockwise 
direction n more times than the path Po does, 



Hence if in conjunction wdth the definition 225 no statement is made 
relative to the path of integration, the value of the logarithm is deter- 
mined only within an arbitrary integer number of Its multivalued- 
ncss is thus apparent. 

This result is also readily obtained from the more familiar definition of 
the logarithm, according to which 

2 = c*"* [230] 

If In 2 is here replaced by In a + Irmj, the value of the exponential 
differs only by the factor which has the value unity. The view 
stated in the preceding paragraph, however, gives some pictorial signifi- 
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cance to this multivalucdness, whereas the more elementary reasoning: 
precludes the possibility of such an interpretation. 

For further elucidation, the following 



more detailed representation is helpful. The 
complex number z is written in the jx)lar 
form 

z = [231] 

As shown in Fig. 18, the path of integration 
is assumed to consist of the |)ortion Li coin- 
cident with the real axis from 1 to r, followed 
by the portion Lo, which is an arc drawn 
from the point r on the real axis to c. 

The variable of integration is also written 


Fig. 18. Integral of 1 c along 

in the polar form 


Li contributes to real part of 


r - 

logarithm: that along L^ con- 

whence 

tributes to imaginar\' part. 

f.J, + 

dp 


II 


[232] 

[233] 


or 


df = dp = (dp -f jpdd)e’ ^ 


f = [235] 

r p 

The integral 225, separated into two parts corresix)nding respectively to 
the |X)rtions Li and L2 of the resultant path of integration, reads 

[ 236 ] 


is the logarithm of the magnitude of z, wherear 


X de = 4> 


is the angle of z. Hence 

In z = In r + [239] 

The multivaluedness of the logarithm is seen to result from the addition 
or subtraction of an integer number of complete revolutions to the path 
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/.). This operation simply adds positive or negative integer multiples 
of 2ir to the upper limit of the integral 238 and hence to 0 appearing in 
Kq. 239. Written in more complete form, this relation, therefore, reads 

In c = In r + y0 + jl'Kn [^40] 

in which n is any positive or negative integer. 

h'or the detailed discussion of the multivalued character of the loga- 
rithm it is expedient to detine the value of Kq. 240 for n = 0, namely, 

Ln c = In r + [241] 

as the principal value of In c, and write 

In c = Ln n + jlirn [242] 

Ordinarily in speaking of the logarithm one has in mind the principal 
value only. 

Now that the nniltivaluefi character of the logarithm and the reasons 
for it are established, attention may be given to the question of how the 
ambiguity can be taken care of when this function enters into some 
problem, for example, in the consideration of the mapping of n* = In z 
in the u- and c-planes. I'o a given point in the c-plane there correspond 
an^inlinite number of iK)ints in the ^i’-plane, all of which have the same 
real part whereas their imaginary parts differ by multiples of 27r. 

Such a set of |K)ints in the ic-j)lane is pre.sented in Fig. 19, which shows 
that the entire ::-plane is mapi)ed in any one of the oppositely cross- 
hatched strips, 2w units wide. Horizontal lines in the tc-plane correspond 
to ratlial lines (0 = constant ) in the >plane; vertical lines in the Ti’-plane 
corres[X)nd to concentric circles about the origin (r = constant) in the 
c-pkine. I'he system of concentric circles and radial lines in the c-plane 
is, therefore, transformed into a rectangular grid in the ic-plane. 

The fact that the locus joining a set of poii.is in the xe-plane corre- 
sponding to the same c-value (like the vertical dotted line in Fig. 19) 
becomes a circular locus in the c-plane which winds around and around 
the origin suggests that the multivaluedness may be eliminated artificially 
though conceiving the c-plane in the form of a winding surface comprising 
an inlinite number of superimiK)sed leaves which have a common central 
point at the origin and simulate a winding stairciise of infinite width in 
which the steps are replaced by a smooth continuous ramp. As the slope 
of this winding surface is made smaller and smaller, the spacings between 
the Icax'es (or successive elevations of the ramp) ultimately become 
negligibly small. 

The one original c-value representing an infinite number of Xt»-values 
differing hyjlrPt is now separated into a uniquely corresponding infinite 
number of s-values which lie directly above and below each other on the 
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vanous stages of the ramp or levels oi the aMitiniious winding surface. 
Each level, or leaf, of the surface carries one of the separate 2 - values 
which now corresponds uniquely to one of the 24’- values. Vo go from the 
value representing a given ic- value to that representing the value 
w it JItt, one must follow the ramp once around the origin in the counter- 
clockwise or clockwise direction respectively. The winding sense of the 


ramp is such tliat the increment i: 



Fig. 19 . Sections of the ic-j)lane each 
of which corresponds to a h af of the 
Riemann surface of In 3 in the 3-plane. 

it is impossible to wind about the 


2 t’ is TyiTT for one revolution in the 
coil n t e rclock w i .sc ( 1 i rec t ion . 

This hyi)otlietical surface which 
ihiis effect i\ely renders the func- 
tion single-N'aliied is a Riemann 
surface. Fhe origin or common 
|H)int about which the surface 
winds is t ailed the icindin}; point ov 
branch point of the Riemann sur- 
tace. It evidently is a singularity, 
for the logarithm function does not 
have a finite value there.* With 
the e.xception of this j)oint and the 
one at infinity, the function u’ - \nz 
is regular in the entire Riemann 
surface, and its integral around 
any clo.sed contour is /xro becau.se 
•rigin in the same sense a nonzero 


number of times and return to a given starting point. 

The logarithm is an illustration of a multivalued transcendental 
function. Ihe Riemann surface for such a function has an inlinite number 
of Iea\ es because the multivaluedness is inlinite. 1 he inverse trigonomet- 
ric functions tan~^ 3, sin“‘ 3, cos'* 3, etc., are other e.xamples. 

A class of multivalued functions having a linite degree of multi- 
valuedness, and hence possessing Riemann surfaces with a linite number 
of leaves, is found in the algebraic functions.! A simple example is the 
function 



In \iew of the preceding discu.ssion it is convenient to write this expression 
in the form 


w = e 


1.2 Ln U— zo)-|->Tn 


L244J 


•It should not be inferred that branch joints necessarily arc singularities of this kind. It is 
po^ible (as in the case of many algebraic func lions) for the function to have a finite and * 
unique value at a branch [xjint. 


fNot all algebraic functions are necessarily multivalued. The rational functions for 
example, are single-valued algebraic functions. ’ 
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For a f^iven value of s, this function has two values, one for » = 0 and 
the other for ;?==!. For « = 2 the same value again obtains as for n = 0, 
and for n = 3 the (n = l)-value is repeated, etc. The Riemann surface 
for this function, therefore, has but two leaves. The branch p)oint is 
located at z — Zi). 

'I'lie structure of the Riemann surface about c = Sq is similar to that 
of the logarithm function about its branch point, but the complete 
surfat e must now be so constructed that the same leaf is regained after 
two complete revolutions about the branch [X)int. The first step toward 
clarifying this picture involves a recognition of the fact that the p>oint at 
infinity has the same character as the |K)int Co- This circumstance is 
s^imewhat easier to see if the variable is changed by the transformation 



Then F2qs. 24v3 and 244 become 

u’ = ( 2 ')-'- = [246] 

in which tlie i>oint z' = 0 corres{X)nds to :: = x. 

Since the txiint at infinity must be included in the present \’i5ualization 
process, it may be a little easier if the c-plane is replaced by its associated 
complex sphere, llus sphere is imagined to have a double surface. Both 
surfai es are slit from the |K)int :: = wo to the ix>int at infinity along a path 
which is arbitrary except that it shall have no crossover points. Through- 
out the entire length of this cut, the top leaves of the double surface are 
now imagined to be joined to the bottom leaves on the opposite sides of 
the cut (a physical im|x)ssibility, of course, but not beyond the powers 
of a g<K)d imagination) so that crossing the cut in either direction effects 
a transfer from the upiKT to the lower leaf of the Riemann surface or 
vice versa. 'Fliis continuous duplex ramp is calle 1 a branch cut. 

The resulting duplex surface evidently has the properties required for 
the uniciue map[)ing of the function given by Eq. 243. The two c- values 
corresixinding to the two tc-values that differ by a factor lie above 
one another on the two leaves of the Riemann surface, and to go from 
one of these c-values to the other it is necessary to cross the branch cut 
and pass completely around either the branch point at = Sq or that at 
c = X . I'he same i:- value may be regained by passage around a branch 
point an even number of times. 

The function 

■w = V {z - 3i)(3 - S 2 ) • • • (s - Zfc) [247] 


is also double- valued, and its Riemann surface again has two leaves. 
The branch jwints arc Si, Sj, • • • s*. For very large values of s the function 
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behaves like z^' and hence it is clear that the p)<)int at infinity is also a 
branch point if k is odd. 

The branch cuts of the Riemann surface are made aIonj< paths joininj^ 
the points ::i and :: 2 , z-^ and etc. These paths cannot have any crossover 
p)oints. In particular, for A’ = 2 the branch |K)ints are z = Zi and z — Zj. 
which are joined by a branch cut in a manner exactly analo^^ous to that 
discussed for the ixnnts c = and z = m the precedini; example. 
In fact, the two-leaved Riemann surface for the tunction 247 with 
A: = 2 is in all respects similar to that for the function 244 except that the 
second branch jxunt lies at a finite c-value. A clo.sed path whicli encircles 
both branch noints Z\ and c- remains on the same leat ot the Riemann 

surface; that is, a ^dven yxnni z on 
this clo.sed path is reLcained after a 
sini^le traversal. 

This situation may be clarified by 
reference to Idi:. 20. A traversal en- 
closini^ both }K)ints :;i ami z-* which 
begins and ends at P must i)e equiva- 
lent to a traversal around the closed 
j)ath Lx followed by a traversal aiound 
the clo.sed path Lj, becau.se the differ- 
eme amounts only to the traversal 
from P to {> and back to P, around the 
shaded area which (ontains no branch 
|)oint. Inasmuch as a .separate traver- 
sal around Li or around L> effects a transition from one leaf of the 
Riemann surface to the other, it is clear that a given jK)int traversing 
these two circuits in succession mu.'>t return to the same leaf of the 
Riemann surface. 



Fig. 20. A point travrr.sini: L-, an<l 
Z 2 in succession must return to the 
same leaf of the Riemann surface. 


Analogous reasoning shows that the cases k = 4 and A’ = 4 have 
similar Riemann surfaces. For A^ = 4 the finite point z.^ replaces the 
branch point which for ^ = 3 occurs at infinity. 4'he surface has two 
branch cuts, and a clo.sed circuit which surrounds any two branch jx)ints 
returns to a given point after one traversal. 4'he extension of this rea.son- 
ing to the interpretation of the two-leaved Riemann surfaces for larger 
values of k is straightforward. 

An extension of the same reasoning likewise leads to the required 
structure of the Riemann surface for the function 


m. 


— \ Z — Zii 

[ 248 ] 

- i.ii z«)-fy 

VI 

[ 249 ] 


This function is w-valued, has a branch point of the im - l)th order 
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at the {X)int 2 = So, and possesses a Riemann surface ha\dng m leaves. 
The fx)int at infinity again has the same character as the point 2 = 2 o, 
and these two jx)ints are joined by a slightly more complicated branch 
cut. At this branch cut the top leaf of the Riemann surface is joined to 
the one located below it on the opj)osite side of the cut, whereas the 
second leaf on the original side is joined to the third leaf from the top on 
the op|)osite side, etc. Finally, the bottom-most leaf on the original side 
of the cut is joined to the top leaf on the oj)j)osite side. The result is that 
a jKith must encircle either branch |X)int m times before a givf n point 
on one of the leaves can be regained. 

It may be useful to observe that the function = \ 2 has a Riemann 
surface entirely similar in its structure to that of the function «:■ = In 2 
e.xcept that the latter has an infinite number of leaves. The branch cut 

ri — 

in the case of the furu tMin = \ 2 is a necessary concept onh' because 
some mec hanism must be imagined whereby the bottom leaf rejoins the 
top leaf of the surface. Actually leaving this mechanism entirely to the 
imagination is more effective than trying to formulate some sort of 
piecing and pasting process between the leaves which must afterward 
be aj)ologized for liecause of the mechanical imix)ssibility of carrying out 
such a scheme in a physical model. 

A mathematician prefers not to be annoyed with the physical difficulties 
involved in the visualization of a branch cut, the more so since it implies 
the e.xistence of a definite jiath along which the passage from one leaf to 
another takes place. .Vccording to the true conception of a Riemann 
surface, such passage is not to be regarded as localized in a branch cut. 
Rather, one is to regard the invention of a branch cut as made necessary^ 
only by reason of the inadeijuacy of one s habitual conception of space to 
comprehend the mechanism of the Riemann surface. 

In the function a- = In 2 this difficulty doe- not appear, but it is 
replaced by the ecpially difficult conception (n an infinite number of 
leaves in the Riemann surface. In terms of the complex sphere associated 
with the 2-plane, the south jxde (origin) and the north pt)le (infinity) are 
branch jx)ints of infinite order. The Riemann surface is a continuous 
succession of spherical shells winding round and round a polar axis, 
somewhat on the order of a snails shell only that the winding pitch is 
zero and the number of revolutions infinite. The surface for the function 
U' = \ 2 has the same structure but comprises only m revolutions, after 
which identity with the original leaf is again established. 

18 . Algebraic functions; more about the classification of 

FUNCTIONS 

From the discussion just given it is clear that if the function w — j{z) 
has a branch point of the {m — l)th order in 2 = 2 o, then, by use of the 
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substitution 

II 

1 

[ 250 ] 

one may study the function 

= /(2o + 

[ 251 ] 


for an appropriate value of r on the various leaves of its Riemann surface 
through letting 0 vary continuously from some initial value 0o to a final 
value 00 + 2 m 7 r. A further increase in 0 will merely yield the same values 
over again. 

The ?>i leaves of the Riemann surface for this vicinity of the ix)int 
;:o are encountered in cyclic order in the intervals 

00 2 iv — Ott <! 0 <C 00 2 t' 7 r [252j 

with V = 1, 2, • •• m. 

The ;w-value5 of the multivalued function fiz) may corres|X)ndingly 
be denoted by 

V, = /(So - ) I 25 i] 

with 00 <0 < 00 -H 27 r. 

The quantities :ci, icj. • • which represent the function on the 

various leaves of its Riimann surface, are called the hrufu/irs of the 
function /(:;). They evidently f<^rm a cyclic grouj). since, for the interval 
00 + 27 r < 0 < 00 -f 47r, for example, ic, replaces ;cj. replaces u-3, 
and so forth, and replaces xci. 

In terms of these branches of the function ic = f^z), an imjK^rtant 
property of the branch fxiint of finite order may now be stated, namely, 
that the same limiting value of the function results for the limit c — > 
regardless of which one of the branches xcj, ■ etc., is cho.sen in the 
process of evaluating this limit. In other words, the value of /(c) in the 
branch point may be approached from any one of the vt lea\'es of the 
Riemann surface. Since the branch ]X)int thus yields a uniejue value for 
the function, it may be included as a jK)int of this surface. 

It is possible for a multivalued function to have this character in the 
vicinity of a point c = Co and there fX)ssess an infinite or finite value, 
either of which is definitely determinable. A singularity of this sort has a 
more general character than that of either a branch iK)int or an ordinary 
pole, since it embraces both these as special cases. It is called an ul^rbraic 
singularity inasmuch as it represents the only kind of singularity found 
in that class of functions known as the algebraic Junctions, 

In order to study the behavior of an w-valued fum tion in the vicinity 
of an algebraic singularity at the point 2 = s;,,, one may make the change 
of variable indicated by 


/ = (C ~ 


[ 254 ] 
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'Fhen 

=/(2) = ^(/) [255] 

becomes a single-valued function of the complex variable /, since the 
latter has taken over the ;?z-valued cliaracter in terms of the original 
variable c. If the function ^(/) possesses an «th order pole at the point 
/ == 0 (corresf)onding to = ^o) then, for the vicinity of this point, it 
may l)e represented by a Laurent expansion whose descending part 
contains ;/ terms. Hence it becomes clear that the function /(z) admits 
the following expansion in the vicinity of an algebraic singularity : 

/r-) = £ C(z ~ [256] 

y = — ri 

This singularity is an ordinary iK)le of the ;/th order for the function /(z) 
if ;;/=!. It represents an ordinaiy' branch i>oint of the (m — l)th order 
if ;/ = 0. 

In general, that is for > 1 and « > 0, the function /(c) is said to 
|K)>.sess a singularity at the fx^int c = Zo which is simultaneously a pole 
of the order n and a branch point of the order ;?/ — !. For example, the 
function /(c) = \ c has a branch i)oint of order 1 at c = 0, whereas at 
c = X it has simultaneously a simple {X)le and a branch |X)int of order 1. 
The Siune statement, with an interchange of reference to the points c = 0 
and c = X , applies to the function/(c) = 1/ \ c. If ;/ is infinite, the func- 
tion has an essential singularity at the branch point of order m - 1. On 
the other hand, if m is infinite, the function is said to have a logarithmic 
hramli point (it is then no longer an algebraic function, for the latter can 
have branch jx)ints of finite order only). 

-Vgebraic functions are defined as functions possessing only a finite 
number of algebraic singularities. If the function is w-valued, it is not 
necessary that all or that even one of its singif irities also be a branch 
jx)int of the order {m — 1), but a suHicient number of these singularities 
must be branch ix)ints of such order and distribution as will insure that 
the m leaves of the associated Riemann surface form a connected system. 
For any nonsingular c-value the function /(c) jx^ssesses the m-values 
indicated by 

/(c) = tt’i, U'oy ’ ' ' ‘i’m [257] 

which represent the function on the correspondingly numbered leaves of 
its Riemann surface. In terms of these branches of the function /(c) one 
may form the system of symmetrical functions 

(-) = + 4^2 “b * * * 

ypoiz) == WiWo + WiW^ + • * • + Wtn^iU\n 

4 'miz) = W1W2 '-Wtn 


[ 258 ] 
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In view of the fact that the branches Tcs. • • • form a cyclic i^nnip, 
it follows that all the symmetrical functions \h, • • ■ \pm are sinj^le- 

valued. That is. as the j)oint :: is allowed to move aloni; any path on the 
Riemann surface, avoiding only the sini^ularities ot f(z), the functions 
zcj, xco, • • ' can merely interchanjxe their identities, and since the 
functions ^^e given by symmetrical combinations of the 

elements :ci, ico, • • • they are not affected by such interchanges. 
Furthermore, they can have only algebraic singularities because this is 
the only kind that the branches ;ci, ico, • • • ic*,,, ]>ossess, and the ^ s are 
formed from the ic’s by the processes of addition and multiplication alone. 
^AJgobraic singularities in tlie case of single-valued tuiu tions, however, 
must be ordinary poles. Hence the functit)ns \p\iz\ must 

be rational. 

Now it is to be recalled from the theory of algel)raic ec[uations that the 
symmetrical functions defined by Juj. 25S satisfy the eijuation 

(:c - -:..)••• cc - 

= ic-'" - (z) • -h . . . ( - I = 0 [259] 

It is also recognized that the functions 258 havt‘ a common denominator 
equal to the product of the denominator'^ of tlu* branches u-j, 
and that this common denominator must be a rational entire function 
of from the bust of Ibjs. 258. or alternatt'ly because the bram lus ici. 

individually are single-valued on their re>jK‘ctive leaves of the 
Riemann surface and there |X)ssess .singularities which, therefore, can be 
none other than ordinary {x)les. Multiplying Kcj. 259 by this rational 
entire function, which is a finite ixdynomial pi,^z \ one finds 

FizAv) = • Xi'"* + pxiz) • xc"*-' f • • • + pjz) = 0 [260] 

in which />o, p\, ■ pm are finite |X)lynomials in c. 

The algebraic functions xx’ = fiz ) arc thus seen to !)e dcMined as the roots 
of an algebraic equation whose cocflicients are ordinary jxilynomials in 
More precisely, the roots are the branches which collectively define a 
single algebraic function on the various leaves of its Riemann surface. 
The function Fiz.'iv) may never be reducible to the product of two or more 
factors having the same form as Fiz.H'), since the vani.shing of any factor 
alone would then satisfy Eq. 260, and hence several indeiK‘ndent functions 
rather than a single one would be defined by this equation. 

The branch points occur for those values of for which Eq. 260 has 
coincident X£»-roots. The coincidence of roots requires that the discriminant 
be zero; and since this discriminant is a rational function of the poly- 
nomials poy Ph ' ' ' pm and hence a rational function of c, it cannot vanish 
identically, but can do so only for a finite number of s-valucs. Hence 
the number of branch points of the algebraic function defined by Eq. 260 
is finite. 
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The eiiKinecrinK student will find it hcli)ful to visualize the Riemann 
surfac e as a set of m parallel metal sheets and the branch points as spot 
welds which join two or more of the sheets at isolated points. The dis- 
tribution of these spot welds and the number of sheets held togethe’* by 
each must evidently be such that the w sheets are connected; otherwise 
instead of a single w-valued function, several functions of lesser order 
than m are defined. 

'Fhe classification of functions in terms of the nature and distribution 
of their singularities, which is partially discussed in Art. 13, may now be 
viewed in a more thorough fashion. All analytic functions may be divided 
into two main classc's, which are the algebraic and the transcendental. 
In other words, any function which is not algebraic belongs to the 
transcendental group, comprising single- or multivalued functions ha\dng 
essential singularities. .Algebraic functions, on the other hand, may be 
further subdivided into single-valued and multivalued functions. The 
single-valued ones are identified with the rational functions, which 
include some c)f the entire functions, namely, the finite polynomials. 
'File meromc^rj)hic functions, which are single-valued and may have any 
kind of .singularity at infinity but must have only poles in the finite 
c-plane, include some of the functions in the transcendental group and 
all the functions in the rational group. The entire functions may likewise 
be regarded as a subclass in the transcendental group, but in addition 
they lay claim to .some of the functions in the rational group. The follow- 
ing block diagram is intended to unify these remarks. 



It is now possible to discuss somew'hat more adequately the inversion 
of functions (which is introduced in Art. 3) with particular reference to 
the vicinity of saddle points described in Art. 14. Suppose a given single 
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valued function Xt* = /(c) has, at a jx)int z = Co, vanishing first and higher 
derivatives up to but not including the ;/th. hor the vicinity of tliis [X)int 
it then possesses the Taylor expansion 

+ < z ,. (= - 2 o )” + "-,+1 (2 - 2 o )"+‘ + • • • [ 261 ] 

with dn 9^ 0. Writing this equation in the form 

"± 11 ^ = (s _ -„)" + £=±1 (; _ Co)"+l + . . . [262] 

dn 

extracting the wth root , and introducing for abbre\iation the variable 


one may put the series 262 into the form 

- (C - Co) -f (c - Co)“ d I 


= (s - Co) 


1/n 


1 -f 




[264] 


The bracketed expressk)n. inclusive of the exponent 1 may for the 
vicinity of the point c = Co (which is not a braiuii i>oint) l)e expanded 
in a Taylor series ha', ing the form 

1 -{- /^i(c — c<i ) h-^iz — C())“ 4- • . • [265] 

This Taylor series represents the bracket fum tion in Kq. 264 on only 
one of the leaves of its Riemann surface. The corresponding representa- 
tions on the remaining n — 1 leaves of tliis surfac e. liowever, cliiYer from 
265 only by the factor 

= 1, 2, • • • ;z - I [266] 

Substituting 265 for the bracket expression in Eq. 264 yields 

T = (c — Co) -f- h\{z — Co)" + h-yiz — Co)’* -h • • [267] 

which, on the particular leaf of the Riemann surface in question, is a 
unique representation for t(c) in the vicinity of the jx)int c = c,,. Hence, 
according to the discussion in Art. 3, it jH)ssesses an inversc‘ function 
whose Taylor series for the vicinity of r = 0 (wliich is a regular j)oint) 
has the form 

(s - Zo) = /3 it -f- -f 03 7^ + • • • [268] 

The coefficients in this series for the inverse function may, for example, 
be found through first substituting the series for (c - Co) into Kq. 267 
and, after arrangement in ascending powers of t, obtaining 

r - fiiT + + (2bi0i02 + Ml + fti)r^ + • • • [269] 
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whence, equating coefficients of like powers of t, one has 

= 1 , 

[270] 

= 26 i 2 - 62 


Substitution of the expression 263 for t into Eq. 268 then gives 

= - - + c^)'"+ 


wliich is a representation for the inverse function c = <i>{w) for the vicinity 
of the {K)int ic = u’o, corresponding to the point s = zq. Equation 271 
may be written 

z ~ 0(xco) -f- C\(lV — tt'o)* “ -f C 2 {iV — -(-•*• [272] 

This, however, is recognized (according to the discussion leading to Eq. 
2vS6) as the expansion of an //-valued function in the vicinity of a branch 
[K)int of the (;/ — 1 )th order at the point = li'o- Hence it is established 
that if a given function ic = fiz) has a saddle point of the (n — l)th 
order at a jx/int c = Co, the inverse function c = 0(zt’) has a branch point 
of the (// — 1 )th order in the corresponding point xi' = u^q. If the point 
z = Co is a zero of the ;/th order for the function w =/(2), the above 
analysis remains unaltered except that icq becomes zero. The inverse 
function c = 0(tc) then has a branch {X)int of the {n — l)th order at the 
origin in the xx-plane instead of at the {X)int w = a’o- 


19. A THKOREM RKGARDIXG THE NUMBER OF ZEROS AND POLES 
WITHIN A (.IVKN REGION; THE FUNDAMENTAL LAW OF ALGEBRA 


A given function xc = /(c) is assumed to have a zero of the order a 
in the point c = Co- The Faylor expansion about this point then reads 

/(2) = - Go)“ + «a+l(= - So)“+‘ + • • • [273] 


and the derivative of f{z) is given by 


f'(z) = adafs — 2o)“~' + (a + l)aa+l(2 — 2o)“ + ' ' ' [274] 


Dividing 

/( 2 ) 


the series 273 into the series 274 by long division gives 
= a(z — So)”' + Co + Ci(5 — Zo) + ^ 2(2 — ZoY + • • • 


[275] 


This function, therefore, is seen to have a pole of the first order in the 
pwint z = 2o with the residue o. 

Alternatively, let it be supposed that the given function /(z) has a pole 
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in 2 * of the order /d. Then its Laurent expansion about this jK)int is given 
by 

f{z) = b_^{z - z*)-^ + /)_4+i (= - + • • • [27b] 

and the derivative reads 

f\z-) = -0b_i{z - - (/J - l)t- 3 +i (2 - + • • • [277] 

By long dixision it is then found that 

= -/3(= - =*)-■ + </„ ^ <f,(s - =*) + d-Az - s*)2 + . . • [278] 

from which this function is seen to have a simple pde in the point z - z* 
with the residue —3. 

.According to the residue theorem it follow's from tliese considerations 
that 

£/-^dz = 2irj{a-p) [279] 

where the path C is assumed to enclose no other zeros or {x>les of the 
function /(c ) except those at the iK)ints Co and z*. 

In general it is seen that if the path (’ encIosc*s zen)s of f{z) having 
the orders ai, jxdes having the orders ^tnd if 

ai + ao + • • • + Oik = .V [ 280 ] 

and 

Pi + ^2 + --+08== P [ 281 ] 

then 

The integer A" is equal to the total number of enclosefi zeros of the func- 
tion /(s), each one being counted as often as its order recjuires; the integer 
P is equal to the total number of enclosed fx)les of the fiim tion f[z), 
each one likewise being counted as often as its order recjuires. 'I'he result 
given by Eq. 282 states that the contour integral fonned for the ratio 
/'//, when multiplied by l/lirj, is equal to the total number of enclosed 
zeros of the function f(z) diminished by the total number of enclosed 
poles, each of these being counted as often as its order requires. 

An application of this result to a special case is of partic ular interest. 
Here the function f(z) is assumed to l)e the finite iX)lynomial 

f{z) = + • • • + ais + flo 


[283] 
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rhcn 


/ (^ ) __ ^ -j- (;z — ‘ A- dj 

f(z) c'* + * + ••• + (iiZ -f- ^0 


[284] 


Dividing the denominator into the numerator by long division, one obtains 
a series of the form 


f(z) n bo F, 

- — - = - 4 - -- 4 - 4 - . . . 

/(=) = ^ ^ r‘ ^ 


[285] 


which converges only for c-values lying outside a circle enclosing all the 
zero.s of the tunction Jiz). In the region outside this circle it represents the 
Laurent expansion for the function f [z) f[z), whence the integer n 
is recognized to be ec|ual to the sum of the residues of this function in 
its jK^les, all of which are located within the circle. Consequently, if C 
denotes this circular contour, then 


lirj Jr f{z) 


[286] 


Inasmuch as the function /!.:), given by Eq. 283, has no poles inside the 
contour C \ the conclusion follows that the highest pow’er n of the poly- 
nomial 2S3 etjuals the total number of roots of the algebraic equation 
/(c) - 0. I'his is the familiar /////</(iwc;//ti/ /tize of algebra. 


20. A MKTHOn FOR THK DETKCTIOX OF ZEROS WITHIN A GIVEN 
RE(;iON 


Another useful application of the result stated by Eq. 282 in Art. 19 
is the following. If a given function zc = f{z) is known to have only zeros 
witliin a region enclosed by the contour thei’ 


I 

2t/ 



= A 


[287] 


equals the number of enclosed zeros. .According to the principles of con- 
formal majrping, the contour C in the c-plane determines an equivalent 
contour in the tc-jdane. I'he integral 287 may correspondingly be replaced 
by 


-Lf 

IrjJD 


dxi' 


= N 


[288] 


in which Z) is a closed contour in the tt'-plane corresponding to the path 
C in the :-plane. From the discussion of the logarithm function in Art. 
17, it is recognized that the integer .Y in Eq. 288 must equal the number 
of times that the contour D in the te-planc encircles the origin. If this 
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contour encircles the orijun more than (nice that is, if ,V > 1 it is 
clear that the inverse of the function = f[z). namely, c = 0 (u’), is 
multivalued, and that its Riemann surface in the u-plane has such a 
structure that a path corresponding to C in the c-plane closes upon itself 
after A" complete circuits around the origin in the ic-plane. 

The contour C may be located anywhere in the c-jilane. 'Fhe conclusion 
to be drawn is that the number of zeros of Ac) enclosed by (’ leach being 
counted as often as its order requires) is gi\'en by the number of times 
that the corresponding closed contour I) encircles the origin in the :c-plane. 
If the closed contour D (which may, when plotted in the simple :c-plane, 
be any closed cunx with or without crossover |)oints ) does not eiu irde 
the origin in the ic’-plane, the given function Ac) cannot jxissess any zeros 
within the region enclosed by C. Hence if it is to be established wliether 
a given function /(c) possesses zeros within a stall'd ri'gion. the (juestion 
may be answered through mapping in the :c-j)lane the locus corres|x)nding 
to a boundary enclosing the stated region in the c-[)laiu‘. and noting 
whether this closed locus in the :c-plane does or doe> not encircle the 
origin. 

It is, of course, necessary to know that f(z) does not have jioles 
within the region in question. The method is evidently also applivable 
to the determination of the presence of j>oles withiii a given region if 
the function is known to have no zeros there. In either ca.se, obvioudy, 
neither zeros nor poles should lie upon the contour C ’. 

In applying the method to practical problems it is helpful to note the 
significance of several detailed characteristics whidi the contour J) may 
exhibit. The contour C is assumed to enclose a simply connected region 
in the .c-plane. If this region contains neither zeros nor }X)les, the cor- 
responding contour D in the xc-planc, as staled above, does not enclose 
the origin; this characteristic of the contour I), }io\ve\er, may not ahvavs ' 
be immediately evident. The inverse function c = 01 a ), except in trivial 
cases, is multivalued, and hence the region enclosed by tinr lontour 
may have overlapping portions which actually, of course, lie on sej)arate 
leaves of the associated Riemann surface, d'hus, if the regnon enclosed 
by the contour C contains saddle points (see Art. 14), the contour I) 
encloses the corresponding branch points, and hence makes se\ eral com- 
plete circuitations about portions of the region enclosed by it. One 
readily appreciates, therefore, that the contour 1) may be a rather 
tortuous path even though the contour C is a simple one. It is jxissible for 
the region enclosed by the contour D to surround the origin completely 
and yet not contain the origin. 

The closed contour C is ordinarily assumed to be traversed in the 
counterclockwise direction, and the region enclosed by it is taken to be 
that on the left of this contour. The corres|x)nding direction of traversal 
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for the contour D may always be established through considering any 
two neighl}oring points on C and determining the corresponding neighbor- 
ing points on D. According to the principles of conformal mapping, the 
region enclosed by D lies to the left of this contour, but this obser\^ation 
alone is frecjuentl} insuHicient to enable one to tell by inspection whether 
the enclosed region contains the origin, and if so, how many times the 
contour D encircles the origin. 

method for the correct evaluation of this situation is the following. 
One imagines a radius vector extending from the origin of the u^-plane 
to a variai)Ie i)oint ic on D corres|Kmding to the variable point z on C. 
.\s c traverses C’ and tc traverses D, this radius vector changes in length 
and in its angular j)osition. For a complete traversal of c around C, the 
net change in the angle of the radius vector must obviously equal an 
integer number of lir radians (a change is |>ositive if it corresponds to 
rotation in the counterclockwise direction). The value of this integer 
ecjuals the numl)er of times that the contour I) encircles the origin. 

If the function u* = f[z) is analytic within the region enclosed by C, 
this integer ( annot be negative, but it can become negative if the region 
also ('ontains i)oles. For example, supjx)se/(c) = 1 and let C be any 
contour em losing the origin in the >plane. Then it is clear that the integer 
in (juestion e(|uals —1. One may say that the contour D in this case 
encloses the {)oint at infinity in the :c-plane. If C is the unit circle, in 
this simple example P is likewise a unit circle. The enclosed region in 
the ::-plane is that within the unit circle; the enclosed region in the 2c»-plane 
is that lying outsidt' the unit circle. The requirement that the function 
fiz) be known to have no |)oIes (or other singularities) within the region 
enclosed by (’ is again recogni/.e<l as being necessary, since the presence 
of ‘a iK)le cancels the effect of a zero as far as the net value of the integer 
is concerned. 

21 . Thk principle of the m.\ximum modulus; Rouche's theorem 

AND Schwarz lemma 

To continue the considerations of the previous article, if the function 
1 C = /(c) is analytic on C and within the region enclosed by it, and if on 
this contour !/(c)| ^ M (smne real jx)sitivc quantity), it follows that 
the contour 1) and its enclosed region must lie within (or be tangent to) 
a circle of radius M concentric with the origin of the it»-plane. This fact 
is clear from the consideration that the region enclosed by D cannot 
extend beyond a boundary' fonned by those confluent segments of the 
contour I) which are farthest (but still at a finite distance) from the origin; 
otherwise the point at infinity would be contained within the region, 
thus contradicting the assumption that/(3) is analytic within C. Since 
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every point within the region enclosed by C yields a point in the ic-planc 
which lies within the region enclosed by /-), it follows that j/(-)| ^ d/ 
for all points within C. Moreo^'er, the equality sign in this relationship 
holds onlv if the region enclosed by D consists of a single j')oint, in which 
case it holds identically, for then/(::) must reduce to the constant M. 

This result is known as the principle of the maximum modulus. It may 
readily be demonstrated analytically with the help of ('auchy’s integral 
formula, Eq. 80. The closed contour 5 {ii[xm and within which /(c) is 
analytic) is identified with a circle of radius p about some finite jxiint z. 
Then 

f - 3 = pe^^ [289] 

/a) =/(s + pe'*) [290] 

and 

Z^=Jde [291] 

s ~ - 

Hence one has 

/(-} = / r.hz + pe>>) do [292] 

Ztt </o 

from which the value of the function at the center of the circular region 
is seen to equal the arithmetic mean of its values on the boundary. 

Since the mean of a set of complex values must be les> than or at most 
equal to the mean formed from the ma.gnitudes of the.se values, it follows 
that the magnitude of /(c) must surely be less than or at most e(|ual to 
the largest value which the magnitude of the function assumes on the 
circular boundary. That is, if M is the maximum value of */{i'jj <>n the 
boundary, ; /(c )j ^ M. 

This result may be generalized to the extent that the bouiulary need 
not be circular and the point c may be any internal point. For if i he result 
were not also true in this more general ca.se that is, if the maximum of 
the absolute value of the function difl not occur on the boundary but at 
an internal point — by applying the specialized result to an apj)ropriately 
chosen small circle about this internal point, one would clearly eiu ounter 
a contradiction. This line of thought alsfj shows at once that the ecjuality 
of \f(z)\ and M can hold only if it holds identically, that is, if /(c) ecjuals 
the constant M. 

If the function f{z) has no zeros (as well as no poles) u]K)n or within 
the closed contour, by applying the same reasoning to the reciprocal 
function l/f{z)y one recognizes that both the minimum and the maximum 
values of \f(z) \ on the boundary are minima and maxima for the enclosed 
region. 
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An alternative proof, which is collaterally interesting, begins with the 
result expressed by Kq. 287 of the previous article. The function f{z) is 
analytic u\Ym C and within the region enclosed by this contour. A second 
function fiiz), in addition to .satisfying the same analyticity conditions, 
fulfills the relation 


< \/(z)\ 


on the boundary C. On this boundary one, tlierefore, has 


A'tr) 

fU) 


< 1 


[293] 

[294] 


and hence the closed contour for the function 


= 1 -f 


/(C'l 


[295] 


in the ^c-planc, corresponding to the contour C in the >plane, clearly 
cannot enclo.se the {X)int xe = 0. Acconling to the discussion in Art. 20, 
therefore, one has 



[296] 


Using Eq. 295, one now finds 

_ LlA^ f ~~ (/ d- .(,0 • df _ g) df 

IV /(/TaO f ^ 7 


[297] 


Hence in view of Eq. 290, the result expressed by Eq. 287 gives 


_L £UI±J1. = ..L ^ ,Y 

iTrjJc / -h .i: 2TrjJvf 


[298] 


from which it may lie concluded that, under th conditions stated above, 
the function /t-) + ^^{z) has the same number of zeros within the region 
enclosed by C as does the function f[z). 

In terms of this result (known as Rom he s theorem), the principle of 
the maximum modulus is easily proved. Suppose the maximum of 
|/(c)l did not occur on the boumlary C but at some internal point Zq, 
Then on the boundary one would have 


1 ^ 1 

fTi^l ^ IA=)1 


[299] 


and hence one could conclude that the function [1 '/(^o^ - 1 ^(s)] has 
the same number of zeros within the region enclosed by C as does the 
function l//(s). The latter has no zeros within this region because /(s) is 
analytic there, and the function [l//(=o) - l//(s)] has at least one zero 
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within the region, namely the one for :: = Co- I'he supposition is, there- 
fore, untenable. 

A more specialized result is obtained if one considers the function /(c) 
to be analytic within a circle of radius R about the origin and to ecjual 
zero at the origin. It then possesses the Maclaiirin series 

/(c) = (iiZ -f- ~l~ [300] 

from which it is clear that the function 

[301] 

likewise is analytic within the circle of radius R. If on the circular bound- 
ary it is known that 

|/(=)I ^ 3/ [302] 

then it follows that there 

[303] 

and, according to the principle of the maxinuim modulus, that the 
magnitude of <p(c ) is I ss than (or at most ecjual toJ .1/ R for all |)oints 
within the circle. Hence one has the result that 

i/(=): ^1-3/ [304] 

for all points within the circle, and the equality tan liold only if it holds 
identically, in which cast- f(z) reduces to a complex constant with the 
magnitude M K, multiplied by c. 

This particular result is known as y>clncitrzs lemma. It has nunterous 
practical applications in problems involving conformal mapping * 

22. Some useful correlations with potential tiif:ory; 
Poisson’s integrals and Hilbert transforms 

In Art. 5 it is pointed out that the real and imaginary parts u and v 
of a function of a complex variable may be interpreteti |)hysically as 
being two-dimensional potential functions. Any two functions of the 
variables x and y which satisfy the Cauchy-Riemann Kqs. 12 and 13 are 
said to be conjugate potential functions. Since Kqs. 12 and 13 become 
interchanged when u is replaced by v and v by -u, it is clear that the 
identities of these two functions become interchanged when the algebraic 

•An application of this sort is discussed in Art. 27, 
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sign of one of them is reversed. In the present article the properties of 
these functions are investigated in more detail. 

The point of departure in these discussions is the Cauchy integral 
formula, Eq. 80. A given function j{z) is assumed to be regular within the 
region enclosed by the circle shown in Fig. 21 as well as on this boundary. 
If denotes any point on the circle and z is any internal point, 



Fig. 21. Change of variable relevant to the derivation of Poisson’s integrals. 


If, in this integral, c is replaced by some point z* e.xtemal to the circle, 
the integrand is regular for all j)oints enclosed by the contour C, and 
according to the residue theorem or Cauchy’s integral law, the value of 
the integral is zero, lliat is, 


0 = 


±,fML 

IrjJc^ - z* 


dc 


[306] 


The point z* is now so chosen that 



£l 


[307] 


in which the bar indicates the conjugate value.* Using this relation, one 
finds 


1 1 




[308] 


♦The |K)int z* thus determined is called the image of the point z respect to the circle. 
The two i)omts z and z* arc geometrically related as shown in Fig. 21. This item is discussed 
in Art. 24. 
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Addition or suhtraction of Eqs. d05 and d()6 is then seen to yield 

i- ^ ? 1 


^ ± AO )+ ± ^ J fa) ^ [dW] 

Ittj Jc Is — z 5 — - I s 


in which Kq. .^05 is also used to obtain the particular relation 




Since, according to the notation mveii in Fiir. 21. 


one has 


Hence 


i' = 

f Rc^_ R' - rRr' ' ^ 

— z AV ’ — rc"^ K‘ + — IrR cu> i/' — <pi 

? f _c>j 

f — s i - z -r r — IrR cos — <j>) 


J_ 1_ _ co s R-- r^ 

^ — z~^C — z A"-f H — 2rA cos — <pj — 2r co.s — 


Equation 312 is uscfl to obtain 


— =jdyp 
i 


Substituting these results into Eq. 309, one obtains the following 
integral representations: 


y TT Jo R^ -f - — IrR CO., 


2irJo + r 


/e-’ - 

- IrR cos {yp — 0) 


f{R,tf/)d4/ [ 318 ] 


It is useful to obser\e that the quantity rR sin — </>) represents the 
area of the parallelogram determined by the vectors f and s ami that 

+ r^ - 2rR cos |f _ z\- [319] 
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is the* sc|iiiirc‘ of the distance l)etwcen the |M)ints f and c. It may, therefore, 
he concluded that the formulas j^iven by K(|s. .^17 and .SI 8 are independent 
of th‘' location of the center of the circle ( \ although in the above deriva- 
tion the center is for convenience chosen to coincide with the origin of 
the c-j)lane. 

If the function f(z) is written more explicitly as 

/(s) = +yr(/, 0 ) [320] 

and K(|s. .SI 7 and vS18 arc separated into their real and imaginary parts, 
one obtains the following two pairs of relations respectively: 

, 1 r-^rR'>\r\ (\l/ — <3) ^ r 

= //(O) f / 7 - 7 — v{R.yp)dyp [321] 

TT •>'0 f “ 

- "( 0 ) - 

7r«/o ( — z - 

and 

1 nir _ ,/> 

i4[r,<p) = - I - ii{R.\p) (I\P 

IttJo s* — 3 ' 

, 1 r-' R- - r- „ , , , 

vir,<t>) = -- / vrciR.ijdi [324] 

in which the relation 319 is used for the sake of abbreviation. It is ob- 
ser\'ed that the formulas in eac h pair become interchanged when u is 
replaced by v and by —u. 

d he results expressed by Kqs. 321 ami 322 show that the real and 
imaginary parts, u and f, of a function of a complex variable are, except 
for an additive constant, explicitly related to each other. The real part 
determines the imaginary part, or vice versa. Hence, given either part, 
the corresiX)nding complex function may be ouml. The formulas 321 
and 322 are said to yield the conjugate ix)terlial function to any given 
jx:)tential function or to transform one such function into its conjugate 
mate. In this sense they are sometimes referred to as a pair of transforms. 

The integrals 323 and 324 likewise fonn a pair, but they do not ex- 
press one ix>tential function in terms of the other. Instead, they yield 
the real and imaginary parts of a complex function at any point within 
the given circle in terms of their respective values on the boundary. If a 
px)tential function is known to be regular for all points on a circular 
boundary and within the encloseil region, it is there uniquely determined 
in terms of its lx)undary values by means of the formula 323 or 324. 

Por r =-- 0, these integrals yield 

w(0) = — r dyp 

It «/o 


[322] 

[323] 


[ 325 ] 
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and 


5(0) = 


[326] 


These results state that the value of either potential function at the 
center of the circle is equal to the average of its values on the circular 
boundary. As a consequence it follows that the largest and smallest of 
the values which u and :• assume throughout a circular regularity region 
must occur on the boundary of that region, for if such an extremum 
occurred at some internal ix)int, the conditions stated by K(|s. o25 and 
326 could not be fulfilled for a small circle with its center at this jK)int and 
its boundary within the original one. 

These conclusions yield a useful theorem to the effet t that if a functiem 
of a complex variable is regular over a given regicui. the maximum and 
minimum values of its real and imaginary parts must for that region occ ur 
on the boundarx’.* The region evidently is not re.slricted to be circular in 
form since the above reasoning is equally applicable when the boundary 
of the regularity region is arbitrary. 

The formulas 321, 322, 323, and 324 are known as Poisson's inlcgrals. 
By a combination of Kqs. 322 and 323 a further result of j)ractical utility 
is obtained. The first step is to form 


= ~ 

27r 2) K- -r r- - 2rR cos 


u(R,i)di [ 32 ; 


From Eqs. 311 and 312 it is seen that 

t + z ^ T- ^ 2?— r- - jlrR sin - <t>) ^ 

f — c Re^^ — R^ + r’ — 2rR cos {yp — <>) 

so that Eq. 327 may be written 

M<t,) = ;i'(0) + ~ f ' [-329] 

ZTT 5 — Z 


By means of the series representation 


l±l 

f - 2 


1 + 2 






[330] 


*This result, although similar to that statcfl by the prinriple of the maximum morlulus, 
sh ould not be confused with the latter. 2\s pointed out in the previou- artit le, tlie mc^lulus 
attains its minimum as well as its maximum value on the bountlary of the region 
of analyticity only if the function has no zeros within this region. That is, if the furu tit)n h;ts 
zeros there, the modulus attains its maximum value, but not its minimum value, on the 
boundary. 
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and Kq. 325, this result may be rewritten in the form 

= m + i£'{} + Y 2 + -) “(^.^) [ 331 ] 

The series in Eq. 330 is uniformly convergent for I 

1 

legration in Ecj. 331 may be carried out term by term. When this is done, 
with 

= f d\p [332] 

TT t/o 

it is seen that the result given by Ef}. 329 may be expressed in the alterna- 
tive form 

/(r.o) = /(O) + L [333] 


< 1, so that thein- 



Tn;. 22. procc.^s \s hit h converts Poisson s integrals into Hilbert transforms. 


*By means of either Eq. 329 or Eq. 333, the complex function is deter- 
mined within the circular region in terms of tlv values of its real part on 
the boundary. 

Other forms for some of these results, more appropriate to the condi- 
tions encountered in electric circuit thex^ry, are obtained through assum- 
ing that the circle in the preceding derivations is an extremely large one 
lying in the right half of the c-plane, tangent to the y-axis at the origin. 
If this circle is imagined to become infinite in diameter, the entire right 
half plane will constitute the enclosed region and the imaginary axis 
(or y-axis) will become the boundary, which then closes upon itself by 
passing through the point at infinity. The manner in which the integrals 
317 and 318 arc to be interpreted in this limiting case is clarified some- 
what when the situation is pictured in Fig. 22. The center of the circle of 
Fig. 21 is imagined to lie on the real axis at a point infinitely far to the 
right. The ixiint J*, which in Fig. 21 is a point on the circular boundary, 
becomes the ixiint jrj on the imaginary axis, and the point s is character- 
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i/c(i in terms of tlie variables .v and v of the plane of Fig. 22 as the point 

•V y V- 

In order to evaluate the forms which the integrals Ml and dl<S assume 
when this limiting pnness is carried out, one should observe that the 
appropriate transition may be indicated symbolically as follows: 

r sin - <>) -> ^y - ry) [^M] 

R- - r — /e" - (A’ - .v)- 2A.V fdd5] 


and 

f + O’ - iV [dd6] 


whereas 

R (Jyj, - * —Jrj 

in which the minus sign appears because the {positive direction of traversal 
of the circular iKuindary in Fig. 21 is countercloikwi.se and in Fig. 22 
this corresponds to the negative direction for the y-a.\is. Finally it should 
be observed that the limits of integratio!\ corresponding to a counter- 
clockwise traversal of the circle are from r x to — -c , or from -- x to 
-f if the algebraic sign of the integral is reversed. 

When these substitutions are made and /.'c) is assumed to vanish at 
intinity,* the integrals oI7 and ,^I8 become in the limit 




uix,y) -h jvix,y} 


1 r* ty - r?)/(yr;) ^ 

jttJ -e .V" + (y — 7])“ 


[338] 


and 


/■V • \ xf{ jrj) 

Jiz) = inx,y) + jv{x,y) = f -~r~ 

TTt/ « X- + (y — ri) 


[339] 


The transformation of the Poisson integrals, which in their usual forni 
apply to a circularly bounded region, to the forms given by Kfjs. 338 and 
339, which apjdy to the right half plane, may be carried out in a some- 
what more sati.sfactory manner (than the heuristic one just given) by 
use of a linear fractional transformation (discussed in Art. 24) which 
effects the mapping of the interior of a circle about the origin ujK)n the 
right half plane. 

From Art. 24, one lincls that the transformation which reads 

/ 

z — a 

Z = ~7 

Z (I 



transforms the interior of the unit circle in the s-plane into the right half 

•The validity of the integral in Eq. 3.^8 is restricted to this condition. The term /(O) in 
Eq. 317 becomes /( oc ) and drops out. 
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of the s^-plane, with the origin of the 2 -plane corresponding to the point 
2 ' = a on the real axis of the 2 '-plane. As the unit circle in the 2 -plane is 
traversed in the counterclockwise direction, the imaginary axis in the 
2 '-plane is traversed throughout its entire extent from -+- 7 ^ to — 7 
(that is, in the negative direction). 

If 2 is some jx^int within the unit circle, z is a [X)int within the right 
half j)lane. 'I'o a point I on the unit circle in the 2 -plane there corresponds 
a jx)int on the imaginary axis of the z'-plane which likewise is deter- 
mined from the transformation v340, that is, 


r = 




[341] 


The desired transformation of the Poisson integrals is carried out 
through returning to I^i- .^09 and introducing there the change of variable 
expressed by Kcis. 340 and 341. One finds from a simple calculation that 


f 

' — (/ ) (2 -r (/ ) 

[342] 

f “ - 

- z) 


2,1 d<' 

[343] 



f (S-' 

— 4- a) 


The integrand appearing in the integral of Kq. 309, therefore, becomes 


I (f' - -f a) ii'' - iJ)\z' -f (/) | dj' 

i f' - s' V -I' I (f' - + <j) 


[344] 


It should now be observed that one wishes to have the origin of the 
2 -plane corres|X)nd to a jx)int in the 2 '-plane which is infinitely remote 
from the origin, because the value of the function at the origin in the 
2 -plane (the quantity /(()) in Eq. v^09) is then arried over into the value 
of the function at infinity in the 2 '-plane. Umler the assumption that the 
function vanishes at infinity (which must, of course, be met by any 
physical problem to which the resulting formulas are applied), the term 
involving /(O) in Eq. 309 then drops out. 

Although it is not possible to determine a linear fractional transforma- 
tion which maps the interior of a circle about the origin upon the right 
half plane in such a way as to make the origin or the center of the circle 
correspond to the point at infinity (this would require an infinitely large 
value for the quantity a in Eq. 340), the desired end may be achieved in 
the present problem by considering the limiting process indicated by 
a — > 00 to be applied to the expression 344. The result reads 


i' 


1 


~ 2 





[ 345 ] 
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Substituting this result for the integrand appearing in Eq. vS()9, dis- 
carding the term f{0) for reasons already stated, and noting that the 
algebraic sign of the integral is reversed if the integration is extended over 
the imaginary axis of the c'-plane in the {positive direction, one has after 
dropping the primes on the quantities z and s' 

f(z) = ^ ’ I [ fU) Jf [346] 

IwjJ-j* (C - s - — S J 


which is the desired result. It is readily brought into the more explicit 
form given by Kqs. and 3M) through writing = .v jy for any fixed 
point in the right half plane and s' = jv for the variable f>oint of integra- 
tion along the boundary or imaginary axis. Ihe two algebraic signs 
appearing in the integrand then yield respect ixely the results 


and 


. 1 r 

^ 77r J~x .V" -I- (y - 

/(=) - '/• 

ttJ-* .r“ + iy — t;)- 


which agree with Eqs. and 339. 

Separating real and imaginary parts yields the pairs 

. , 1 (,v - »)):■((),»)) ^ 

- - ' f ~ d, 

TrJ-.x-+(y-ri)- 


and 


, ^ IT" xu(0,v) j 

u(x,y) = - / , y.drj 

Tt/— » X‘‘ + (y — n)* 


+ (y - ir 
xi<(0,n) 


, ^ If- xi-(0,v) 

v(x,y) = - / 

x^ + {y - V)’ 


^ + (y - v)- 

XV(0,ri) 


[347] 

[348] 

[349] 

[350] 

[351] 

[352] 


The last two integrals determine the potential functions in the right 
half plane in terms of their respective values on the imaginary axis, which 
is regarded as the boundary of the right half plane. The individual rela- 
tions in the pair 349 and 350, on the other hand, may be used to determine 
either of the potential functions in the right half plane in terms of the 
boundary values of the conjugate function. A potential function along 
the boundary is given in terms of the conjugate function by either of the 
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integrals 349 or 350 for re = 0. One thus obtains the pair of relations 


„(y) = l r dv 

[353] 

1 

8 

1 

7r«/— » y — 7j 

[354] 


which are known as Hilbert transforms. 

Because of the singularity of the integrand at the point = y, the 
value of either of these integrals, in the ordinary sense, does not exist. 
It is, however, jK)ssible to overcome this ditliculty* by the definition of a 
particular process of evaluation yielding the so-called Cauchy principal 
laluc. I'his value is obtained through approaching the point v = y 
symmetrically from both sides, as indicated for the integral 353 in the 
following expression: 




limit 

«-»o 



‘ I'iy) dr) 
y - r) 



[355] 


In order to see that this limit has a definite value, one may represent 
the function ifr)) as 

vin) = v{y) + (i) - v) • p(y,n) [356] 


in which p^y,n) is regular in the vicinity of the point rj = y. Then 

«/— * y — r\ t/ - « y — T) » 

The principal value of the first of the two integrals on the right-hand side 
is obtained through the following steps: 


f -3±-^ J _ [in (y _ _[in(y-,) 

y — 7j «/!/+« y “ ’y L J— « L Jy+e 


/— 00 y — 7 ^ 

When this is written in the form 


[in (y - 7j)] ^ - [in Cy - ’/)]_^ 
the limiting process is found to yield 

IW, [in (f )] - limit [l„ . 0 


[358] 

[359] 

[36C] 


*Scc E. C. Titchmarsh, “ Conjugate Trigonometric Integral?,” Proceedings of the Lottdon 
Maihemutical Society, 2n(i series, 24 (1926) 109-130; “ On Conjugate Functions,” ibidem^ 
29 (1929) 49- 81. 
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Hence only the second inte^^ral on the right-hand side of K(|. v^57 remains. 

According to Eq. v^56, the integrand piy^rj) of this second integral is 
observed to decrease no faster than 1 ,. rj for large values of rj regardless of 
whether :’(r/) vanishes or remains finite for r; — > oc . Although it is true 
that the integral from zero to infinity of a function which decreases no 
faster than 1 rj for large rj does not remain finite, it is essential in the 
present instance to observ e that if the limits of the integral are from — x 
to X, and the integrand is an odd function about the |X)int 77 = x , one 
can appreciate in a general way how a finite \'alue can result by noting 
that the contributions to the integral on the ix:)sitive and negative sides 
of the point x have opixisite algebraic signs. This is the same sort of 
reasoning as that involved in deriving the finite principal value .>55. 

By reference to Eq. 356 it is seen that if v{yi) vanishes for rj x , then 
p{\\rj) is an odd function about the point tj = x; hut if v(rj) remains 
finite in this vicinity, piyw) is odd about 77 = x only if v(tj) is an even 
function; that is, if — 77 ) = : ( 77 ). Hence one is led to the conclusion that 
the given potential function t'( 77 ) either must vanish for 77 x or must 
be an even function of 77. Inasmuch as an odd function about x must 
necessarily be either zero or infinite at infinity, it is sufficient to state 
merely that the potential function t’( 77 ) shall remain regular in the 
vicinity of 77 = x .* Since all the above discussion applies equally to the 
evaluation of the integral 354, the same final comment applies also to 
the conjugate function in 77). 

It is thus seen that the transforms 353 and 354 may be written in the 
alternative fonn 



»Cy) = - ' f 

TT 


* U (ri) — U(y) 
» V - 


dr] 


[361] 

[362] 


Here the functions 21(77) ~ 2 '(y) ancl 2/(77) — u{y) vanish in the vicinity 
of the point 77 = y at least as strongly as the factor (y — 77), so that there 
can be no question as to the finiteness of the integrals so far as the 
presence of the factor (y — 77) in the denominator of the integrand is 
concerned. 

In the evaluation of the integrals 351 and 352 for the boundary x = 0 , 
a difficulty arises due to the fact that the integrand vanishes everywhere 
with the possible exception of the point 77 = y. Yet it must be true that, 
in the limit x 0, the integrals yield the values «(0,y) and 2 '( 0 ,y). That 
they do so, may be shown in the following way. 

*For a more thorough discu.ssion of the question of convergence of the Hilbert transforms 
the reader is referred to the article by K. C. Titchmarsh already cited. 



Ar/. 


POISSON'S INTEGRALS AND UILBERT TRANSFORMS 341 


Since the only |)ossil)le contribution to the value of the integral in the 
limit jt* — > 0 must come from the immediate vicinity of the point rj = y, 
it is clear that for example, the function wfO,?;), in Eq. 351, may be re- 
placed by its value «(0,y) at the point v = y, and since it then has 
nothing to do with the process of integration it may be placed in front of 
the integral sign. It then remains to show that 


u(0,y) = 


r ^ i 

TT [ J X -f (y — 


or that 
limit 

j -*<) 


1 Xd rj 

TT j * -V“ -f- (y — T? ) “ 


= limit 

X— »0 




[363] 


1 [364] 


which is evidently true. 

I'o return to the integrals 353 and 354 (or the ecjuivalent ones given by 
K(js. 361 and 362, it is useful to obser\'e that if 1(77) is an odd function 
of Y}, then H(r]) is an even function, and vice versa. To see this, one may 
write the integral 353, for example, as 


«Cy) = - / 

V(rj) drj ^ 1 r^v(rj) drj 

[365] 

TT t/ 

- M y — 77 TT t/O \’ — 7} 

or 



u(y) = - W 

''-•°v(v)dri ^ 1 dri 

[366] 

TT t/i 

> y — rj TT t/n \’ — rj 

If v{rj) is assumed to be odd, 

replacing the x ariable of integration 

v in the 

lirs*t of the above two integrals by — rj shows that 


1 r-* 

V(ri) dri 1 r V(ri) dit 

[367] 

ttJo 

y — 7} IT Jo y -|- Tf 

Hence Ecj. 366 becomes 



II (y) - f ( - , 

TT Jo \v — t) y + V/ 

[368] 

or 



uiy) = 

tJo 71“ — y“ 

[369] 


From this form it is immediately evident that w(y) is an even function 
of y. 

By means of an entirely parallel process, one may transform the 
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integral 354 with assumed to be rccn and obtain 



from which it is apparent that ?(v) is an odd function of v. 

It is to be obser\’ed that the transforms 369 and 370 are no longer 
general but apply only to cases in which u is even and :• odd, as is assumed 
in their derivation. This restricted character of the fomis 369 and 370 is 
recognized at once from the fact that these integrals no longer become 
interchanged when u is replaced by e and v by — //. 

One might just as well have derived special forms for the opjx)site 
assumptions, namely that u be odd and r even. However, for the rational 
functions and many others met in practical problems, the real j)art is 
even and the imaginary part is odd when regarded as a function of y for 
.V = 0, that is, for points on the imaginary axis. The following more 
detailed discussion pertains to this im|x>rtant special case. 

If the same method of transformation is applied to the ecjuivalent 
forms 361 and 362, the results obtained are 



The last of these is the same as 370 except that ii{t)) — ti{y) replaces 

It should be mentioned that the even function //(y) is determined by 
the integrals 369 or 371 except for an arbitrary additive constant. How- 
ever, the difference between any two values of w(y) for specified y-values 
must be uniquely determined. For example, using the integral 369, one 
finds 

m(=c) - m(0) = - /* [373] 

T«/0 y] 

which is a simple expression for the difference between the values of ti 
at the two extremes in the range of y-values. 

This last expression may be put into a practically more useful form 
through introducing the change of variable indicated by 

[ 374 ] 

\yj V 

•If a given potential function is constant, the conjugate function is zero. Therefore, the 
conjugate oiu{ri) -j- constant is the same as the conjugate of u{ri). 
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which amounts to introducing a logarithmic scale in place of the linear 
77- or >'-scaIe and choosing the arbitrary fx)int rj = yo sls the new origin. 
If v(0) represents the function v{rj) with respect to the logarithmic 
0-scale (that is, v{6) is v{r}) plotted on a logarithmic scale), the relation 
373 takes the form 

u('-r-) — i((0) = ^ f vis) dd [375] 

TT «/ - X 

or 

J viO) ^ 0=2 ^ ■“ [376] 


in which ui and u{0) still denote the values of u for y = ^ and y = 0 
respectively.* 

'riiis result states that the net area under the cur\'e for the function y, 
plotted on a logarithmic scale, depends only ujx>n the difference between 
the values of tlie corresponding //-function at the extremes of the y-scale. 
Although this result is only a partial statement of the implicit relation 
between the conjugate {X)tential functions it and i\ its simplicity makes it 
})articularly useful in connection with the practical problem discussed in 
the references just cited. 

A comparai)le result obtainable from the integral 370 reads 

2 F * 

limit [vT(y)l = — f ii{rj) drj [377] 


.\nother useful particular relation between it and v for points on the 
imaginary axis = jr], z = jy) may be obtained from the integral 372. 
The latter may be rewritten in the form 


n,v)=7'^ 

TT 1 


1 drj 


2\y v/ 


Introducing the change of variable 




[378] 


[379] 


and writing u{d) for the function u{rj) in terms of the new logarithmic 
variable 0 give 


TT */ — 


' u(d) — «(0) 
» sinh 6 


dd 


[380] 


♦Thi'^ particular form as well as the ones given by Eqs. 377, 380 and 393, were obtxiined by 
II. W. Bode. Sec his U.S. Patent 2,123,178; also “ Feedback .\mplifier Design,” Bell Systevt 
Technical Journal, 19 (July 1940), 421-454. 
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The integrand in this integral remains regular throughout the entire 
range — oo <6 < x . At the limits 6 = dti^ , ti{0) has the finite values 
of u{rj) lor T] = X and t/ = 0 respectively, whereas 2 sinh 0 — > dzc 
so that the integrand is seen to behave like c~ for large jx)sitive or 
negative values of 6. For the vicinity of the {x^int 6 = 0, 

and 

sinh 0 0 [382] 

so that the integrand there maintains a finite value. 

The integral 380 may be transformed into a more useful form through 
the method of integration by parts, which reads 

J' = p<i - f ‘i'^p 

Letting 

p = u{0) ~ u{0) [384] 

and 

il0 

dq = [385] 

^ sinh e '■ •' 

one has 

= [386] 

and 

0 B 

^ = In tanh - = — In coth - [387] 

Thus Eq. 380 becomes 

i'(3') = ^] + tX- (^) ^ 

[388] 

in which the absolute value of the variable B must be used because the 
integral 380 has a real value and hence the argument of the logarithm 
must remain positive. Since the quantity u{0) — w(0) appearing in the 
first term of Eq. 388 remains finite at the limits ^ = ± 00 , the value 
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of this term is zero at both limits. Its values throughout the range 
— 00 < ^ < 00 , moreover, are finite, since at the critical point ^ = 0 it 
has the value 



[389] 


This term, therefore, vanishes and there remains 

= ^390] 

1^1 

The factor In coth — appearing in the integrand has a logarithmic 

infinity at the point 6 = 0 and is a symmetrical function about this point, 
dropping off rather rapidly on both sides. Since 6 = 0 corresponds to 
rj = y, it is seen that the value of the function :’(y) at any point is largely 
determined by the slope of the conjugate function uItj) at that point 
when plotted on a logarithmic scale. 

This intcq:)retation of the result expressed by Eq. 390 may be clarified 
through writing 

\Wd=o \dr} d6)r,=fj \d6)o 

and noting that* 

f " In coth 9 (/« = 2 f * In coth = ^ 

J-«o 2 Jo 2 2 


[391] 

[392] 


Then, by adding and subtracting {du/d6)o to {du d6) in the integrand of 
Eq. 390, one obtains the fonn 



dii\ 1 r** \ 


In coth 



[393] 


Here the first term represents the major contribution to the function 
t’(y). If u{6) is a symmetrical function about a given point v — y (that is, 
$ = 0), then {du/d6) — (du^ d6)i) is an odd function about this point and 
(because In coth [|0| 2] is even about 0 = 0) the contribution of the 
integral in Eq. 393 becomes zero. Its value in any case is seen to depend 


•Making the change of variable .v = coth B. 2, one finds 


2 


i 


In coth - dB 
2 


J f^lnxdx 
I 1 -:r “ 2 


which is No. 3 of Table 187 in Bicrens de Haan, Nouvdks tables d'inUgralcs definUs (Engels, 
Leiden), 1867. 
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only upon the asymmetry of the function ii(d) about the jx)int ^ = 0, and 
because of the critical character of the factor In coth (;0|/2), only the 
asymmetry in the more immediate vicinity of this {X)int can have an 
appreciable etTect. 

Although the computational labor involved in the solution of many 
problems of this sort may be quite heavy, the necessary theoretical basis 
for such a solution is evidently afforded by the Hilbert transforms or by 
moditications of these. However, when certain as]>ects of electric circuit 
theory are dealt with, a somewhat more complicated version of the 
problem presents itself. Instead of either u{y) or n v) being specified over 
the entire range 0 < y < x , ;/(y) is sjx'cified only over portions of this 
range and i’(y) is specified over the remainder. For e.xample, uiy) may be 
specified for 0 < y < Vo and v{y) for y„ < y < x . d'he problem is to 
determine i({y) and fiy) for the ranges in which they are not specified,* 
that is, the object again is to determine the whole complex function over 
the entire range 0 < y < x . 

The clue to the method of solution lies in dividing the complex function 
u(y) + jv{y) by the irrational factor \ 1 — y- y,)-, which is real over the 
range 0 < y < yo and imaginary over the complementary range yo < 
y < X . Hence the resulting function reads 


and 



for 0 < y < yo 


for Vo < y < ^ 


[394j 


[395] 


The significant part about this result is that the real and imaginary 
parts of the resulting function alternately involve // and v in the two 
ranges. The same effect may, of course, be obtained if the complex func- 
tion u + jv is multiplied instead of divided by the irrational factor, but 
the resulting real and imaginary parts might then no longer remain finite 
in the vicinity of the point y = x . Also, one could presumably find other 
irrational functions which are alternately real and imaginary in the 
ranges 0 < y < yo and yo < y < but the one used above is perhaps 
the simplest. 

Substituting the real and imaginary parts of the functions 394 and 

*The solution to this variation of the problem wiis obtained by H. \V. Bode, loc, cit. 
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395 for u(y) and v{y) in the transforms 369 and 370, one finds respectively 



and 


for 0 < y < Vo 


[396] 

for yo < y < 


TT 



u{t)) di) 




TT 



vii]) di] 


ir 




o - 1 


v(y} 



for 0 < y < Vo 


[397] 

for yo < y < 


Equation 397 is used when «(y) is specified over the range 0 < y < yo 
and viy) over the range yo < y < ^ ; Eq. v596 applies when the reverse 
is true. As an illustrative example, one may suppose that //(y) is specified 
to be constant throughout the range 0 < y < yo md that v(y) is constant 
over the complementary range. Inasmuch as u{y) is, by the present 
methods, determined within an arbitrary constant, one may simplify 
these given data somewhat by assuming ii{y) to be zero over the range 
0 < y < yo and adding the desired constant value to the whole solution 
afterward. Thus one has for the given data 


u(y) =0 for 0 < y < yo 
v{y) = Vo for yo < 


[398] 


When Eq. 397 is applied, the first integral drops out because u = 0. 
The remaining expressions are somewhat simplified by the substitutions 


X 



^ = 


z 

yo 


[399] 
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One then lias 


. for 0 < jTo < 1 


— r [400] 

^ i.V‘ “ -VoM\ .V- - 1 -//(.Vo) . ^ 

I - — ; — -JZI for I < Xo < CO 
[ ^ *Vo“ — I 

The integration may be accomplished through noting first that 

r* (h 1 r* dx 

^ Ji ( v- — .vo“ )\ .V“ — 1 (.V — .Vo)\ .V‘ - 1 

1 /•* dx 

- - / — =F= 

2*^1 (.V + .Vi))\ .v‘ - 1 


Using the substitution 


one finds 


whereas with 


there results 


.rap - 1 

X - .Vo 


dz _ \ 1 — .v,,“ dx 

^'1 - (.V - .Vo)\ .V- - 1 


ara’o + 1 

X + a*o 


-iL= - [«5i 

V 1 — S- {x -|- .Vo)\ .V" — 1 

The changes in the limits of integration which accompany the substitu- 
tions 402 and 404 being noted, the integral on the left-hand side of Eq. 
401 is found to be given by 

1 r dz r^o dz "1 

2V\ - V v/1 - 2* Vl - zJ ’ 

The integrands in these integrals are even functions. Hence the integral 
from 1 to Xq may be replaced by one from — :ro to — 1. The two integrals 
in the expression 406 may then be combined, giving 

2^^(>vo ^ vq dz 

TT Jl (^2 _ X(i^)V — 1 wVl - Xi)^*^-^0 V 1 — 
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The integration is now no longer diflicult. Equation 400 is thus observed 
to yield lor 0 < Xq < I 

v(Xo) = — sin“‘ io = — In (\/l - + jxo) [408] 

TT T 


whereas for 1 < a*o < "-c , in which range it is appropriate to consider the 
right-hand side of Eq. 407 in the form 


7r\ 


;Vo- - 1 ^-0 


dz 


■xo — 1 


[409] 


the integration yields the real part 

u(xo) = In (\/.Vo“ - 1 + Xo ) 

TT 


[410] 


One readily recognizes from these results that the complete complex 
function is given over the entire range 0 < y < by the expression 

»(y) +>0') = Jn (Ji - +'^^) [-ill] 

The procedure may readily be extended to problems in which the range 
0 < y < is di\'i(led into more than two subranges. For example, if u 
and I' are alternately specified in the ranges 0 < y < yi, yi < y < yo, 
y-j < y < , the Hilbert transforms are applied to the real and imagi- 

nary parts of the function {n -f Jv) divided by \/(l — yVyi“)(l — 

The determination of the appropriate detailed relationships follows the 
same pattern as given above for the simpler problem invohing two sub- 
ranges. 


23 . More about potential theory and conjugate functions 

Additional useful physical inteij^retations relevant to properties of 
functions of a complex variable may be had from pursuing further the 
intimate relationships between certain of these properties and potential 
theory. The starting point for the present discussion is the simple physical 
situation shown in Fig. 23. The plane of the paper, which is also regarded 
as the complex c-plane, represents a plane perpendicular to an infinitely 
long straight conductor. The origin of co-ordinates for this plane is chosen 
coincident with the conductor, which is assumed to carr>^ a uniform linear 
distribution of charge of q coulombs per unit length and also a uniform 
current of i amperes whose reference direction points upward. 

According to elementary theory, the electric field intensity S and the 
magnetic flux density at the point Sq are given by the negative gradient 
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of corresponding potential functions'^ and that is, 

6 = — grad T) £B = — grad 7 [412] 

with 

"D = ^ In - [413] 

and 

7 = -Ifjiiip [414] 

in which £ and /u are the dielectric pcrmitti\ity 
and magnetic permeability resjxfctively of the 
homogeneous isotropic surrounding medium.* 
The field comp)onents in polar co-ordinates are 




[415] 

1 

II 

’ rdq, 

[416] 

By substitution from Eqs. 413 and 414 these become 


- 2q _ 

®r = — 

er 

6^ = 0 

[417] 

% = 0; 


[418] 


e-plane 




7^0 


"A 

7 \t 




conductor 


Fig. 23. The plane perpen- 
dicular to an infinitely long 
straight charged conductor 
carr\'ing current. 


It is now useful to observe that if one defines a single complex potential 
function 


^ 1 .-All 

= — -f J — = - In = In — 

iq Zfit Zq 

[419] 

in which 


Zq = 

[420] 

the field components are determined from 


dw e&r . ®r 
dr 2q ^ ^ 2/xi 

[421] 

and 


dw . £6^ 

r d<t> 2q ^ 2\ii 

[422] 


'Unrationalized MKS units are used in these discussions. 
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or the resultant fields from the relation 


, e6 . 
-grad-^- 


[423] 


The real and imaginary parts of the gradient of the complex potential 
function yield the electric and magnetic fields respectively (except for 
constant multipliers). 

It is thus seen that the real and imaginary parts u and v of the simple 
function 

IV = \n — = u jv [424] 

2o 


when multiplied respectively by the constant factors 2q/e and Ifjti are 
the scalar electric and magnetic potential func- 
tions for the single charge and current-carrying 
conductor of Fig. 23. This example, incidentally, 
aiTords an interesting physical interpretation of the 
multivalued character of the logarithm function. 

Since, for the scalar magnetic potential T, the 
radial lines are equi{X)tential loci, this function 
changes continuously in the same direction as 
one proceeds along a path encircling the con- 
ductor, each complete revolution yielding an 
increment proportional to 27 r. If T is likened to 
an altitude function in a physical terrain, the 
.similarity of the Riemann surface to a winding 
ramp is realized. ''J'he branch |X)int about which 
the surface winds becomes, in terms of this mag- 
netic field inteq)retation, a vortex thread which is 
physically realized by the current-carrying cc i- 
ductor. This singularity for the field and the function representing it is 
indeed the seat of its excitation. 

A pair of identical conductors are now considered. They are spaced 
symmetrically with respect to the origin as shown in Fig. 24 and are 
assumed to carry charges and currents of equal magnitude but opposite 
sign. The complex potential function in this case is, with reference to 
the notation indicated in the figure, 



Fig. 24. The plane 
perpendicular to a pair 
of identical infinitely 
long conductors carry- 
ing current in opposite 
directions and carrying 
opposite charges. 


w = In 


So - « 


-In 


1 


Zo + a 


[425] 


or 


\So — d/ 


[ 426 ] 
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It is now assumed that the spacing 2a between the conductors is very 
small compared with the distance to any }X)int Zo at which the held (or 
the resulting function) is to be studied. For the moment this spacing 
may be regarded as being finite, although for some of the subsequent 
discussions it is more appropriate to allow the limiting process a — ♦ 0 to 
be completed. At all events the condition 


a 





[427] 


is assumed to be fulfilled, so that Eq. 426 is replaceable by 

1 / 4 2(A 2a _ _ 

^ In ( 1 -f - ) ^ — [428] 


with any accuracy that may become necessary. 'I'liis form for may be 
obtained through expanding the logarithm in a Maclaurin series in terms 
of the variable 2a Co and retaining only the first term, or it may be derived 
in a somewhat more lucid fashion through first writing the right-hand 
side of Eq. 425 in terms of the defining integral for the logarithm, thus 


/ 2r, + ild^ 
'T 


which is equivalent to 


w 


JzQ—il f 



[429] 


[430] 


If the condition 427 is fulfilled, then the points a and Cq — a are so 
close together that this integral is very nearly given by 


•iv 


■-f 

ZqOzh- 


‘^o+a 2a 
di = -- 
2o 


[431] 


which agrees with Eq. 428. 

As stated above, the conductors in Fig. 24 also carry currents i having 
equal magnitudes and opposite directions (upward from the plane of the 
paper for the conductor carrying +q) so that the real and imaginary 
parts of 'iV again represent scalar electric and magnetic ix)tential functions 
when multiplied by their appropriate factors. These, incidentally, may 
just as well be assumed to be numerically equal, as expressed by 



(numerically) 


[432] 


Then the common factor may be included in the expression for w. Since 
this procedure facUitates matters somewhat, it is assumed for the follow- 
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ing discussion that 

Aqa 

w = 

62o 


[433] 


It is now possible to let the spacing 2a become as small as desired 
without having the magnitude of w become small at the same time, since 
one can demand that as 2a is made smaller and smaller, the charge q 
be made corresix)ndingly larger so that the product 

2aq = m [434] 

remains ( onstant. In the limit a —►0, ^ approaches infinity. This limiting 
configuration of oppositely charged conductors is a useful concept and 
is referred to as a dipole or doublet. The quantity w, called the moment of 
the (lijxile, is considered to be a vector coincident with the line joining 
the two charges (the axis of the dipole) and directed from the negative to 
the jxisitive charge. The vector character of this dipole moment is taken 
into account through its being considered a complex number although 
in the present example it is real, as is evident from inspection of Fig. 24. 

The complex ix)tential function for the dipole is 


2m 2m , , 
w = — = — 

6So £r 


[435] 



Fig. 25. A dipole of moment m 
Although in this derivation the dipole with arbitrary position and 
is assumed to have a very particular lo- orientation, 

cation and orientation in the s-plane, 

namely, that shown in Fig. 24, one may readily generalize the expression 
435 so that it becomes appropriate to a dipole in an arbitrary position 
and orientation as is shown in Fig. 25, where the dipole moment makes 
an angle ^ with the reference axis. The dipole is located at an arbitrary 
point 2, so that one now has 

{zq - 2 ) = [438] 


With m denoting the magnitude of the dipole moment, it is readily seen 
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that the appropriate modilkation of K(|. 435 reads 


... _ _ 2m 

€(Co - z) er 

im 

which has the real and imaginary parts 


2;w ^ . 

u = — cos (^ — (p) 

£r 

[440] 

and 


2m . 

V = - sin {\p — 0) 

£r 

[441] 


It is significant that if the dipole is rotated through 90 degrees in the 
clockwise direction — that is, if ^ is replace<l by ip — (w 2) then u 
becomes replaced by and v by — w. In other words, rotation of the (liiK)le 
through 90 degrees in the clockwise direction has the effect of replacing 
each potential function by its conjugate. Further discussion of this useful 
obser\'ation is given presently. 

The preceding derivations also show that the dijx^le may be regarded 
as the result of merging two logarithmic singularities of opposite {xilarity 
and infinite magnitude. As Eq. 439 shows, the end result is a simple ix)le, 

and the complex residue of the function u* in 
this pole, except for the factor 2 s is numer- 
ically equal to the vector moment of 
the dipole. One thus obtains an int(*resting 
physical inteq^retation for the kind of sin- 
gularity designated as a simple jxile, 
namely, that it may be visualizc^d as the 
seat of an electric or magnetic di{x)le with 
its moment proportional to the residue of 
the function in this {X)le. 

It is logical to continue with a similar 
study of poles of a higher order of multi- 
plicity. As the following discussion shows, 
these are represented by the merging of 
symmetrical configurations of larger num- 
bers of charges having equal (infinite) magnitudes but signs which are al- 
ternately positive and negative, there being an equal number of each. For 
example, the configuration following the dipole in its order of complexity 
is the so-called quadripole, consisting of two positive and two negative 
charges arranged alternately and spaced symmetrically on the circum- 
ference of a (vanishingly) small circle of radius a. Next in the order 
of complexity is the configuration involving six charges of which three are 



Fig. 26. A 2«-pole centered at 
the origin. The position of -f 
and — charges is correlated to 
the form of the resultant com- 
plex potential function. 
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positive and three negative, and so on. In general such a configuration is 
referred to as a 2«-ix)le, n being the number of positive or negative 
charges. The dipole is a 2»-pole for w = 1 ; the quadripole is a 2w-pole for 
n — 2, and so forth. 

Figure 26 shows a 2;/-pole for n = 6 with its center at the origin of the 
c-plane. The jx)ints Ci, 32, • • • Z 2 n are uniformly spaced on the circum- 
ference of the small circle, the even-numbered ones corresponding to 
jx)sitive charges and the odd-numbered ones to negative charges. The 
resultant complex potential function is given by 

In h In — ! h • • * + In 

3o — 32 3o — 34 2o “ ^2n 



- In — ! In — - 


•^0 ~ ^3 


- In ^ ^ 1 [442] 

So “ 32n-lJ 


or 


2^/ , (3(1 — 3i)(3o — 33 ) * • • (3o — 32n— 1 ) r a a 

IV = — In -- ^ r — [443] 

£ (3(, — 32 ) (3() — 34 ) • • • (3o — 32n) 


N'ow it is recognized that the values of 3 i, 33 , • • • 32 n-i are the roots of 
the ecjuation 

c” + a” = 0 

which are 

- = (/ v'^ — 1 = 


[444] 


{p= 1, 3, 5, •••2;^- 1) [445] 

and the values of 32 , 34 , • • • 32r, arc the roots of the equation 

3 " - a” = 0 [446] 

wdiich arc 

2 = a ^ (v = 2. 4, 6, • • • 2 m) [447] 

Consequently, 

(So - Sl)(So - S 3 ) • • • (Zo - Z 2 n-l) = Zq" + a" [448] 

and 

(zo - S 2 )(zo - S 4 ) • • • (So - Z 2 n) = Zo" - a" [449] 

SO that Eq. 443 becomes 



[ 451 ] 


Again if 
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the logarithm in this expression is very nearly given by 



[452] 


If the moment of the 2;/-pole is defined as 

m = 2qa^ [453] 


its complex potential function becomes 



O 


eco” 


2m 


e 




[454] 


which yields the result expressed by Eq. 
435 for — 1 . 

When the 2;/-pole is located at an arl)i- 
trar\* point z and has an arbitrary orienta- 
tion, as shown in Fijr. 27, its complex 
potential at the point ::o given by 


w 


2m 


S(20 - 2)" 


[455] 


Fig, 27. A 2«-pole centered at 2 . because Zq in Eq. 452 becomes replaced 

by (-0 “ z)c~^'^. Here the angle ^ is evi- 
dently determined only within an additive positive or negative integer 
multiple of the angle 2Tr/n. 


If again one writes 

when 


so that 


and 


(so - 2) = 

■U)=~ 

Sr" 


2m ,, . 

« = — cos n(tf; - 0) 


V = 


2m 

Sr” 


sin m(^ 




[456] 

[457] 


[458] 


[459] 


According to Eq. 455, the 2M-pole evidently represents a pole of 
multiplicity n for the function w. 
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It is now of interest to consifler a configuration of charges resulting 
when many identical difx)les are placed side by side, with a uniform 
infinitesimal spacing, their centers lying on any curv’e and their moments 
all ix)inting toward the same side of this curve. Such an arrangement is 
shown in Fig. 28. If the dimension peqxmdicular to the plane of the paper, 
in which longitudinal uniformity obtains, is mentally supplied to this 
jncture, it is readily recognized that this arrangement may be regarded as 


^2 



Fig. 28. The cross section of a sheet having a uniform surface distribution of dipoles. 

a uniform surface distribution of diix)les, and that the result is equivalent 
to having a surface or membrane uniformly covered with a charge of 
equal magnitude but opposite sign on its two sides, the thickness of the 
membrane corresponding to the spacing la between the charges in the 
di|X)les. 

This configuration is called a uniform donhh stratum. If the magnitude 
of the surface density of charge is denoted by a*, and la is the thick- 
ness of the membrane, or the separation of the surface charges of 
opposite polarity, 

2a<r = 7 [460] 

is defined as the moment of the double stratum. In terms of the individual 
dipoles with their charges Azq, one recognizes that if the uniform spacing 
between the dipoles is denoted by the differential dSy 

q — ads 


[461] 
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and hence 


lag = m = T ds 


The problem is to determine the scalar electric potential u at the ix>int 
Co- This may be determined through integrating the expression 440 over 
the surface of the double stratum. Thus, the differential contribution to 
the net potential at Zq due to the single dijxde at the ]K)int c is, with the 
help of Eq. 462, 


du = — cos (\l/ — <t>) = 
£r 


2t ds cos (\f/ — 0 ) 


But from Fig. 28 it is recognized that 

— rd<t> = ds cos — 0 ) 

so that Eq. 463 becomes 

du = 


Hence one obtains for the net potential at the point Zq 



This remarkably simple result states that the potential is proportional 
only to the net angle subtended at the point Zo by the boundaries of the 
double stratum. The shape of the double stratum between these bounda- 
ries is immaterial. It is essential to the achievement of this simple result 
that the moment t of the double stratum be constant over its entire 
surface. The expression given by Eq. 466 applies only to the }X)tential of a 
uniform double stratum. 

It is now a simple matter to find the potential function which is the 
conjugate of this potential u of the double stratum. It is pointed out 
above in connection with the single dipole that the conjugate to the 
electric potential function u results when that dipole is rotated through 90 
degrees in the clockwise direction. According to the principle of super- 
position, the conjugate to the double-stratum potential results when all 
the dipoles constituting it are rotated through 90 degrees. This process 
converts the double stratum into a dipole chain. If one chooses to make 
the spacing ds between the dipoles in the double stratum equal to the 
spacing 2a between the charges in the individual di[X)les, that is, if 

ds = 2a [467] 

the adjacent positive and negative charges in the dipole chain fall on 
top of one another and hence cancel, so that there is left only one nega- 
tive charge — ^ at the point Z 2 and one positive charge q at the point z\. 
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The choice indicated by Eq. 467 is, according to Eq. 462, equivalent to 
making 


q = r 


[468] 


so that the potential function conjugate to u given by Eq. 466 becomes 


V = 


2r To 

— In — 
£ ri 


[469] 


Since replacing u by v and by —u again yields a pair of conjugate 
|)otential functions, one may also write 

w = — In — [470] 

c To 

and 

2 T 

“ (<f>i ”* 02) [471] 


These are evidently the real and imaginary parts of the complex function 

IV = ^ In ^73-7^ [472] 


in which Co of Fig. 28 is now considered to be any variable point s. 

'Fhe response functions of linear passive electrical networks (driving 
[)oint and transfer impedances or admittances) are rational functions 
of the form 

r(\ _ ~ ^l)(^ ^3) • • • 1) 

^ ^ {Z - Z^:t)(z ~ C4) • • • (S - Z2m) 



I'll problems of this sort it is expedient to WTite 

f{z) = c® = 

and have, for the so-called loss and angle functions A and 5, 

(- ■” -l)(- ~ ^3) • • • (S — S2m-l) 


= A + jB = In 


{Z - Z2){Z - S4) • • • (- - S2m) 


[474] 

[475] 


Thus the loss function A may be regarded as the electric potential result- 
ing when a system of identical charged conductors are placed at the 
|X)ints ^2, C4, • • • ::2m and a set of conductors with charges of opposite 
sign are placed at the txiints -1, S3, • • • S2m_i. The function A is usually 
studied for pure imaginary values of c, that is, for points on the imaginary 
axis in the c-plane (these correspond to real frequencies in steady state 
circuit analysis). B is then the conjugate potential function with respiect 
to id . A problem in network design is thus reduced to the determination 
of a distribution of the points Si, 22, S3, • * • which yields suitable functions 
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A and B. The visualization of the problem lent by the analogy to jx)- 
tential theory can in many cases be helpful. 

If the rational function, Eq. 473, is expanded in partial fractions (as 
shown in Art. 16), each term is recognized as having the form of Eq. 439. 
The real and imaginary parts of this function may, therefore, be in- 
terpreted as the potentials due to dipoles at the points co, C 4 , • • • Som- 
More specifically, the real part (which may be a resistance or con- 
ductance) is the ix)tential of the diix)le distribution represented by the 
partial fraction expansion of /(c), whereas the imaginary part (a corre- 
sponding reactance or susceptance) is the conjugate potential resulting 
after all the dipoles are rotated through 90 degrees in the clockwise 
direction. 


24. Some useful fltnxtioxs in conformal m.\pping; the linear 

FRACTIONAL FUNCTION 

The artifice of introducing a change of independent variable by 
substituting for the given one some function of a new variable usually 
implies the transformation of one region of the complex plane into 
another. A contour in the one region is thereby mapped upon the other. 
The two regions may be mutually exclusive, or partly or wholly over- 
lapping, depending upon the nature of the transforming function. A very 
common substitution is 




which is more explicitly written 
f = f 


1 


^ - Jy 

X- -h y- 


whence 






In polar co-ordinates 


z = 

f = pe'* 


so that the transformation 476 yields 

1 


[476] 

[477] 

[478] 

[479] 

[480] 


[ 481 ] 
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and 

^ = -0 [482] 

Because of the relation 481, this substitution is spoken of as a transforma- 
tion in terms of reciprocal radii. Since p is larger than, equal to, or smaller 
than unity as r is respectively smaller than, equal to, or larger than unity, 
it is seen that the region of the ::-plane within the unit circle is trans- 
formed into the region outside this circle, and vice versa, whereas the unit 
circle itself remains intact except for a shifting of specific points according 
to the relation 482. In particular, the points dzl on the real axis remain 
fixed by the transformation, and these are consequently spoken of as the 
fixed points. 

A significant property of the transformation 476 is that the origin 
and the point at infinity become interchanged, that is, s = 0 corresponds 
to f = 00 , and 2 = X corresponds to f = 0. Thus a given function f{z) 
may be studied in the vicinity of 2 = ^ through introducing the change 
of variable 476 and then studying the resulting function F(f) = /(1/s) in 
the vicinity of f = 0. 

A variation of this transformation reads 



** 

[483] 

in which the bar indicates the conjugate value. With 



f* = 

[484] 

this yields 


[485] 

and 


[486] 

All the points on the unit circle are now fixed points. Corresponding points 
z and f* may be found by means of the geometrical construction indicated 
in Fig. 29, as may be seen from the fact that the similar right-angle 
triangles yield the relation 


\z\ OA 

OA ~ lf*| 

[487] 


in which OA is the radius of the unit circle and hence equal to unity. The 
point f* is said to be the image of the point z with respect to the unit circle. 
The transformation indicated by Eq. 483 is, therefore, also spoken of as a 
reflection or inversion with respect to the unit circle. As the point z traces 
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some figure inside the unit circle, the corresjxmding point f* maps the 
image of this figure outside the unit circle. 

If the unit circle is replaced by a circle of radius R, the transformation 
reads The geometrical construction shown in Fig. 29 applies 


to any value of R. If R 


1 




Hi 


H 



Fig. 29. The point z and its 
image = 1/5 as obtained by 
graphical construction. 

The transformations s* = 


is held constant as R is allowed to become 
larger and larger, and attention is focused 
u{X)n that part of Fig. 29 in the \icinity of 
the points c and alone, the circle will 
appear to degenerate into a straight line 
and the two points in question will ulti- 
mately be symmetrically located with re- 
spect to this line in the sense of an object 
and its mirror image. 

Other variations of this transformation 
which are sometimes used are f 

and = — f*. All these substitutions 
have the property of transforming circles 
into circles, as is shown from a more general 
point of view in the subsequent discussion. 
:±i\/z are, of course, conformal, and hence, 
as shown in Art. 2, angular relationships are preser\^ed in sense as well as 
in magnitude. The transformations f* = zt\/z are isogonal but not 
conformal. They preserve angular relationships except for a reversal of 
the positive sense of angular measurement, as in the right- and left-hand 
relationship between an object and its mirror image. For any closed con- 
tour not encircling the origin,* the direction of traversal is preserved by 
the transformations f = ±l/s, whereas it is reversed in the transforma- 
tions f* = it 1/s. These relationships are indicated for a set of circles in 
Fig. 30, in which a typical point P on the given locus z is also indicated in 
each of the transformed loci. 

It is collaterally useful to observe that, for the transformations f' = 
— 1/s and f* = 1/s, a given locus lying wholly in the upper (respectively 
lower) half plane, remains wholly within that half plane. In these cases, 
the upper (respectively low^er) half plane is said to be transformed into 
itself. Similarly, the substitution f = 1/s transforms the right (respec- 
tively left) half plane into itself, whereas f*' = —1/s transforms the 
right into the left half plane, and vice versa. In a specific problem the 
particular form of the substitution chosen is thus seen to depend upon the 
detailed requirements set by the nature of that problem and the purpose 
for which the substitution or transformation is made. 

•The conclusions in this statement are reversed if the contour does enclose the origin. 
This fact may readily be appreciated from the study of a few simple examples. 
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The substitutions f = ±1/2 are special cases of the more general 
linear fractional or bilinear form 


az A- h 
cz A- d 


[488] 



Fig. 30. Corresponding circular loci of z and various related reciprocals. Note the 
various senses of angular relationships. 


in which the complex constants a, h, c, d are subject to the condition that 
the so-called determinant ad — be oi the transformation shall not vanish, 
that is, 

ad -be 9^0 [489] 


and, in addition, 

e 7 ^ 0 


[490] 


The function 488 then is regular everywhere except at the one point 


d 

c 


[491] 


at which it has a simple pole. 

Inclusive of this point, there is a unique correspondence between all 
points in the 2 - and f-planes, so that the inverse of the function 488 which 
reads 


d^-b 
— cf + a 


[492] 


has identical mapping properties. 
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The condition 490 is necessary in order to prevent 488 from degenerat- 
ing into an ordinary linear form.* The reason for the condition 489 is 
appreciated through consideration of two points f i and f 2 in the f-plane 
and their corresponding fK)ints Zi and zo in the ;:-plane. Equation 488 
then yields 


(ad - bc){zi - Z 2 ) 
(cci “h d){cz2 “h d) 


[493] 


whence it is clear that a unique correspondence of points does not result 
if ad — be = 0. 

In addition to '^i and fo, two more points fs and s '4 together with their 
corresponding points S 3 and S 4 are now considered. By analogy to the 
form of Eq. 493, it is then readily seen that 

T3 ~ Tl _ ^.3 ~ ^ CZ y + d 

?3 f 2 -.3 ~ -2 ^*-1 + d 

and 

^4 ~ _ ^4 ~~ ^ rs 2 -V d 

f4 1 2 "4 ■“ 22 CZi -f* d 

and hence that 

T3 ~ / ^4 ~~ _ 2.3 ~~ / ^4 •" 

iz ~~ I ^4 ■“ ^2 23 — S 2 / 24 — Z 2 

With reference to Fig. 31 it is clear that 

^3 — 2i ^ ka — 2i I 

23 22 [23 — Z2I 

and 

24 ^ 2i ^ 1^4 Z\ \ j/3 

24 ”•22 I 24 — Z 2 I 

Now if the points Zi, Z2, S3, Z4 are assumed to lie on a circle, it follows 
that a = /?. The right-hand side of Eq. 496 is then seen to reduce to a 
positive real constant. Since, according to a familiar geometrical proposi- 
tion, the angles a and ^ can be equal only if the four points in question 
lie on a circle, and since, according to Eq. 496, a similar geometrical 
relationship to that showm in Fig. 31 likewise applies to the quantities 

•The mapping properties of the linear transformation w — az h are recognized as 
amounting to a magnification and rotation according to the magnitude and angle of the 
complex constant a, and a translation equal to the value of b. Any picture or map subjected 
to this transformation obviously remains undistorted. The mapping properties of the linear 
transformation are, therefore, said to be homographic. 


[494] 

[495] 

[496] 

[497] 

[498] 
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f 1 , f 2 , fa, f 4 , one may conclude that the corresponding points Ti, f 2 , fa, f 4 
must also lie on a circle. An important property of the transformation 
488 or its inverse 492 is thus proved, namely, that this linear fractional 
form carries circles over into circles. In other words, a circular locus in the 
c-plane corresponds to a circular locus in the f-plane. Straight lines are 
included here since they represent circles with inlinite radii, but it is not 
implied, of course, that straight lines are 
necessarily carried over into straight lines. 

'I'he linear fractional transformation is, in- 
( identally, the most general analytic trans- 
formation yielding a one-to-one correspond- 
ence between all points in the simple w- and 
c-planes. The term “ simple ” U'- or ::-plane is 
intended to denote such a one in which the 
uniciue distinction between points does not 
require the concept of a multileaved Rie- 
mann surface. With the interpretative aid of 
such a surface, any analytic function yields 
a one-to-one corresix)n(lence between |X)ints 
in the xi’- and :;-planes. In the simple comple.x 
plane however schlichte Ebene in the 
German literature), a one-to-one correspond- 
ence between points is afforded only by the so-called schlicht functions, 
of which the linear fractional function is the most general form. 

'I'he fixed points of the transformation are those for which f = 5 , 
or, Ecj. 488 being used, they correspond to the s-values satisfying the 
equation 



Fig. 31. Construction used 
in showing that the linear 
fractional transformation 
carries circles over into 
^circles. 


az h 
cz + d 


[499] 


which is equivalent to 

cz^ {d — a)z — 6 = 0 


[500] 


The roots of this quadratic equation may be denoted by Zi and S 2 - Then 
= Zi and fo = S 2 [501] 

If the f-plane is superimposed upon the s-plane, any circle passing 
through the points Zi and 22 is transformed into another circle passing 
through these same points. Hence the family of circles passing through 
the fixed points Si and Zo is transformed into itself in the sense that any 
one of these circles is transformed into another belonging to the same 
family. Now these circles possess an orthogonal family of loci which are 
also circles. These enclose the points Zi and S 2 , as shown in Fig. 32. Since 
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the transformation is a conformal one, it follows that the transformation 
of an orthogonal circle must also be orthogonal to any of the circles 
through Cl and Co. Hence the family of orthogonal circles is transformed 
into itself also. 



Fig. 32. Circles through fixed points Zi and c-. are transformed into other circles 
through the siime two fixed points. 


For proper choices of the constants in Eq. 488, this transformation 
may be made to carr>^ any specified circle in the c-plane over into an 
independently specified circle in the s*-plane. In order to see this one may 
write by analogy to Eq. 496 



in which Ci, C 2 , S 3 are distinct and s is a variable |X)int on a given circle 
in the c-plane, whereas fi, {' 2 , fa, and f are the corresponding distinct 
and variable points on the resulting circle in the f-plane. If one writes 
for the sake of abbreviation 


H = 


fa - fi / Sa - Zi 
fa “■ fa/ S 3 — C 2 


[503] 


Eq. 502, which is equivalent to the transformation 488, takes the form 


'f - f 2 s - C 2 


[ 504 ] 
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Here Si, S 2 , S 3 are any distinct points determining a specific circle in the 
s-plane, and fi, f 2 , fa are any chosen corresponding points in the f -plane 
determining a circle which the desired transformation is intended to 
yield. 

In particular, the points Zi and Z 2 in Eq. 503 may be selected as the 
fixed points of the transformation. Then si = Zi and ^2 = S 2 , so that 


H == ~~ 

fa ~ S2/ S3 — Z) 


Equation 504 then reads 



f — Zo Z — Zo 


[505] 

[506] 


This is called the normal form of the transformation 488 or its inverse 
492. 

Two special types of this transformation may now be defined. For the 
first of them, any member of the family of circles passing through the 
fixed ix)ints Zi and ::2 is transformed into itself. This statement means 
that the [x)ints C 3 and fa lie on the same circle, although they are, of 
course, not coincident like the points fi = ;:i and {*2 = S 2 ; otherwise the 
transformation would degenerate into the trivial identity f = s. The 
points Si, S2, -3, fa i^re then four distinct points on the same circle, like 
the four points ::i, c.*, - 3 , -4 in Fig. 31. According to the reasoning which 
shows the right-hand side of Eq. 496 to be a positive real constant, it 
follows from the similarity between the right-hand sides of Eqs. 496 and 
505 that H is a positive real non vanishing constant. This then is the condi- 
tion for which the transformation 506 (or its equivalent 488) carries any 
circle through the tlxed points Zi and zo over into itself. This t\"pe of the 
transformation is designated as hyperbolic. 

The second special t\pe has the property th..t any one of the orthogonal 
family of circles enclosing the fixed px)ints is transformed into itself. In 
order to recognize the condition which yields this result, one must recall 
that any circle enclosing the fixed points is, according to well-known 
principles in analytic geometry, defined as the locus of a point for which 
the ratio of its distances from the two fixed points (or poles) remains 
constant. If z denotes a variable point on such a circle, 


= constant 
S2I 


[507] 


The condition that the transformed |x>int f shall lie on the same circle 
evidently reads 


1 


-r_ii 

f - S 2 


Z — So 


[ 508 ] 
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which, with regard to Eq. 506, means that 

|H| = 1 (H ^ 1) [509] 

The parenthetic statement H 1 is intended again to rule out the 
trivial case f This special type of the transformation is designated 
as elliptic. 

With regard to the further general properties of the linear fractional 
transformation given by Eqs. 488 or 506, it is useful to recognize that 
this transformation not only maps the i^nnts on a given circle in the 
c-plane upon a corresponding circle in the {“-plane, but also furnishes a 
one-to one corresix)ndence between all points within those regions of the 
G- and {"-planes bounded by these circles. In other words, the transforma- 
tion is said to map one of these regions upon the other. 

Now a circle may be considered to be the lx)undary cither of the region 
enclosed by it, or of the complementary region which lies outside it. In 
order to remove this ambiguity, and also to furnish a clearer visualization 
of how the mapping of corresponding regions is effected by the trans- 
formation, one may consider a specific circle K in the c-plane and the 
corresponding circle C in the {"-plane. The points Zi, So, c.i lie on the circle 
K, and the respectively numbered corresponding points fi, {* 2 , f.j lie on 
the circle C. In the order in which they are numbered, the points in each 
set form a sequence which fixes a definite direction of traversal (clockwise 
or counterclockwise) around their respective circle. In this way, corre- 
sponding reference directions of traversal are fixed for the two circles K 
and C. It is clear from the preceding discussion that a variable point 
following along the circle K in its reference direction is transformed into 
a point which follows along the circle C in the corresponding reference 
direction. 

One may now imagine that at some point z the variable point following 
along K suddenly leaves this circular boundary by making a right-angle 
turn to the left. According to the principle of conformality, the corre- 
sponding variable point on C must likewise make a right-angle turn to the 
left at the point f corresponding to 2 . If one imagines a second circle A' 
in the 2 -plane lying wholly within the region to the left of K, with a diam- 
eter which differs from that of by a small amount, this line of reasoning 
shows that the corresponding circle C' in the f-plane lies wholly within 
the region to the left of C. Continuing this process of reasoning (by 
applying the same line of thought to the circles X', C' and a pair A", C" 
lying wholly within the regions to the left of K' and C', etc.) one finds it 
clear that the entire region to the left (or to the right) of K is mapped 
upon the region to the left (or to the right) of the circle C. 

If the reference directions of traversal around the circles A' and C are 
the same (both clockwise or both counterclockwise), the region inside 
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(respectively outskie) K is mapped upon the region inside (respectively 
outside) C, whereas if the reference directions are opposite, the region 
witlan K is mapped upon the region outside C, and vice versa. 

As an illustrative example, let it be required to find the function 
which will map the region inside the unit circle about the origin upon the 
region outside the unit circle. Since the unit circle forms the common 
boundary for the two regions in question, one must look for that linear 
fractional transformation which transforms the unit circle into itself. 
If the reference direction of traversal for the unit circle in the z-plane is 
assumed to be counterclockwise, whereas for the unit circle in the f-plane 
it is assumed to be clockwise, the regions to the left of these boundaries 
in the z- and f-planes are those which are to be mapped upon each other. 
With these points in mind, the normal form for the required transforma- 
tion is readily established. 

In Eqs. 505 and 506 one may choose the following correspondence of 
points: 

C, = +1 3-.. = -1 23 = -j 

fi = +1 f;’ = ~i fs = +y 


This makes s = ±1 corresixmd to the tixed points. Substituting into 
Kq. .s05, one finds 


li = 


y - 1 / - / - 1 




y + 1 / -j+\ 


= -1 


[511] 


so that the normal form of the required transformation, according to 


Eq. 506, becomes 




- 1 2-1 

f + 1 3+1 

[512] 

This is equivalent to 

1 

z 



[513] 


which is the simple transformation discussed earlier in this article, and 
evidently yields the desired mapping relationship. By choosing different 
sets of corres{X)nding points which nevertheless satisfy the condition of 
opposite reference directions around the unit circle, one may find in- 
numerable additional transformations which also map the inside of the 
unit circle upon the outside. It is, of course, not necessary that two of the 
chosen points be the fixed points of the transformation, since Eqs. 503 
and 504 apply to any arbitrary sets of corresponding points. 

As a second example, let it be required to find the substitution which 
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maps the inside of the unit circle in the c-plane upon the upp)er half of the 
f-plane. According to the principles discussed above, the following 
corresponding sets of points may be chosen: 

Si = +1 So = “1 Za = -y 

fi = +1 {*2 = —I i*3 = 0 [51^^] 


The unit circle in the s-plane is traversed in the counterclockwise direc- 
tion from + 1 around to — 1 and through — / back to 4- 1 ; the real axis in 
the f-plane is traversed from + 1 through the {’Kiint x to — 1 and thence 
‘through the origin back to -f 1- The regions to the left of these boundaries 
are those which are to be mapped u|X)n each other. Substituting into 
Eq. 505, one finds 





1 

7+1 



[515] 


so that the normal form of the dcsirecl transformation becomes, according 
to Eq. 506, 


r- 1 

f + 1 



[516] 


This is equivalent to 
f = 


s 4- 

— 


or 


s =7 


^Lzl 
f +y 


[517] 


Here the points s = ±1 again arc the fixed points. 

Alternatively one may solve the present problem by assuming the 
corresponding sets of points 

Zi = 4-1 22 = -\j 23 = -1 

fi = — 1 f2 = 0 fa = 4-1 [518] 


None of these are fixed points. Consequently the forms 503 and 504 must 
be used. According to Eq. 503, 



-2 

-1 -y 


1+/ 


[519] 


Substituting this into Eq. 504, one finds 


f = -7 


:^-l 


z +j 


or 


z ] 


.•f-y 


f +7 


[520] 


Comparison with Eq. 517 shows that these results are identical except 
that z is replaced by — z. 

From the nature of this problem it is obvious that Eqs. 517 or 520 still 
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constitute solutions if z is replaced by ze^"^, where 0 is any chosen angle. 
For example, replacing z in Eq. 517 by —jz yields another possible 
solution, namely, 




1 - s 


or 


1 -h s 

This has the corresponding points 

s = +1, +jy 

f = 0, -fl, 


J + f 

[521] 


[522] 

00 , -1 


Again none of these are fixed points. The point s = — 1 here corresponds 
to the point x in the f-plane. In the transformation 517 the latter 
corrcsix)nds to the jx)int z = j; in Eq. 520 the point z = corresponds 
to f = X . Evidently the present problem may be solved with any 
desired point on the unit circle in the c-plane corresponding to the point 
X in the f-plane. 

Alternatively, one may for example map the right half of the f-plane 
u|xm the inside of the unit circle in the c-plane. The transformation 
which yields this result is obtained from any of the solutions to the pre- 
ceding example by replacing f by^s*. This change amounts to making the 
substitution f' = —yi' and subsequently writing f for again (a substi- 
tution which rotates all points by 90 degrees in the negative direction). 
Applied to the Eqs. 521, for example, this process yields 


f = 



or 


Lzl 

1 + f 


[523] 


for which a set of corresponding points are 

s = -fl, -fy, —1, — y 

0 , -y, 00, +y 


[524] 


It is observed that these are the same points as those in the set 522 except 
that the fqx)ints arc rotated 90 degrees in the negative direction, so that 
the imaginary instead of the real axis becomes the boundary of the 
mapped region in the f-plane. 

^ It is instructive to study the conformal maps of one of these functions 
in somewhat greater detail. The transformation 523 is particularly in- 
teresting in this respect, since it is perfectly symmetrical in the variables 
z and f. Figure 33 shows how a system of concentric circles about the 
origin in the s-plane, along with the orthogonal family of radial lines, 
are mapped in the f-plane. The origin in the z-plane becomes the point 
+ 1 in the f-plane. The concentric circles within the unit circle of the 
3 -plane become eccentric circles about -f 1 in the f-plane, the unit circle 
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itself correspx)nding to the imaginary axis of the j'-plane. The radial lines 
within the unit circle of the c-plane become those i)ortions of the orthogo- 
nal circles through the points dbl which lie in the right half of the f-phme. 
The circles of the c-plane (shown dotted) which are larger than the unit 
circle become eccentric circles in the r-plane about the point —1, and 
the portions of the orthogonal circles through the |K)ints f = ±1 which 
lie in the left half of that plane (also shown dotted ) corres|)ond to those 
portions of the radial lines in the :3-plane lying outside the unit circle. 



(a) (b) 

Fig. 33. Map of the interior of the unit circle in the c-plane to the right-half s'-planc 
by the transformation c - (I — s'' (1 ^ 

A more comprehensive view of this situation is gained through recog- 
nition that the points -f 1 and — 1 of the f plane are transformed into the 
origin (south pole) and the p>oint oc (north pole), respectiv(‘ly, of the 
complex sphere associated with the s-plane. In other words, the poles of 
the complex sphere associated with the 2 -plane are transformed into the 
finite points ±1 of the f-plane, and the resulting map in that plane 
represents the corresponding distortion suffered by the concentric circles 
and radial lines in the s-plane which accompanies this shifting of the 
poles. This interpretation is responsible for the term “ bipolar circles ” or 
bipolar plot by which the map in the f-plane is also known. 

Since the forms 523 are symmetrical in the variables z and f , it follows 
that the sets of orthogonal loci in the 2 - and f-planes of Fig. 33 may be 
interchanged. That is, if concentric circles about the origin together with 
the family of radial lines are drawn in the f-plane, the corresfxjnding map 
in the 2 -plane becomes that which in Fig. 33 is shown for the f-plane. 
In other words, the transformation 523 may alternatively be said to map 
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the inside of the unit circle in the f-plane uix)n the right half of the 
s-plane. 

A .vord of caution may be appropriate at this point in order to guard 
the reader against confusing part (b) of Fig. 33 with the similar appearing 
plot of Fig. 32. In the latter, the poles Zi and z-y are the fixed points of the 
transformation. Figure 33(b), on the other hand, is not the corresponding 
plot for the transformation 523, since the fixed jx)ints in this case are 
evidently not the joints f = ±1. The maps in Fig. 33 are merely a pair 
of corresix^nding sets of curves chosen to illustrate the way in which the 
region enclosed by the unit circle is mapped upon the right half plane, 
and these curves have nothing in common with the type of plot showm in 
Fig. 32. The reader may further clarify his thoughts on this score by 
determining the fixed i^oints of the transformation 523 and by subse- 
quently drawing loci of the type shown in Fig. 32. 

In connection with the preceding general discussion it is relevant to 
add the following remarks regarding several collaterally useful properties 
of the linear fractional transformation. It should readily be appreciated, 
for example, that the end result of carrying out two different linear 
fractional transformations in succession can also be had from a single 
one. More siK'citically, if the relation 


-f- bi 
CiZ -f- di 


[525] 


represents a transformation from the variable to a new variable z\ 
and 


O’iZ * 4 " b'2 
+ dy 


[526] 


represents a succeeding transformation to -nil another variable z'\ 
it is always {possible to relate to c directly oy a transformation of the 
same form, namely. 


az T b 
cz d 


[527] 


The truth of this statement follows immediately from the consideration 
that the linear fractional form carries circles over into circles and con- 
versely that any univalucd analytic transformation of circles into circles 
is linear fractional (sec footnote on page 375). Thus, two such trans- 
formations applied in tandem obviously accomplish no more than can be 
accomplished by a single one.* 

^Because of this property the totality of possible linear fractional transformations is said 
to form a group. 
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The coefficients a, b, c\ d in the single resulting transformation 527 may 
be determined from the coefficients in the two separate transfonnations 
525 and 526, through substituting the relation 525 for c' into Eq. 526 and 
putting the result into the form of Eq. 527. One finds that 

a = (Lydi 4 - boCi 

b = a 2 b I -f b od I 

[528] 

C — C‘2(Ji “T doCi 
d = Cobi -t- dodi 


It is interesting (as well as relevant in dealing with circuit problems 
involving a cascade of transmission networks) to observe that the rela- 
tions 528 may be expressed more compart ly by means of the single 
matrix equation 


[ a b __ r«?2 ^ ^1 

C dj~ Lf2 d-i] ^ Lc di_ 


[529] 


Another interesting fact is brought out when the linear fractional 
transformation, Eq. 488, is put into the form 


r = 



[530] 


as is always possible so long as the conditions 489 and 490 are fulfilled. 
In view of this form, one may regard the arbitrary linear fractional 
transformation as equivalent to carrying out in succession the simpler 
transformations indicated by 


z = z-V - 
c 

[531] 


[532] 


[533] 


The first of these component transformations represents a displacement, 
the second is a simple inversion, and the third represents a rriagnification 
and rotation (multiplication by a complex constant) followed by a dis- 
placement. Since each of these component transformations carries circles 
over into circles, one arrives again at the conclusion that the linear 
fractional transformation has this property. 
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Finally, an interesting^ correlation with the process of stereographic 
projection may be mentioned . As pointed out in Art. 4, this geometrical 
process is one whereby every finite point in the complex z-plane has 
uniquely associated with it a {xiint on the so-called complex sphere which 
is tangent at the origin. If a second plane (f-plane) is imagined to 
be tangent to the .same sphere at some other point, and if the same 
geometrical process is used to as.sociate points on the sphere with points 
in the second plane, it is evidently possible to state that the system con- 
sisting of the two planes and the sphere, together with the geometrical 
process of stereographic projection, enables one to associate uniquely a 
|X)int c in one of these planes with a corresponding point f in the other 
plane. 

It is of interest to show how the transformation implied by this process 
can be expressed analytically in terms of the linear fractional function 
488.* Although the converse of this .statement is not generally true, it is 
worth noting that some of the linear fractional transformations en- 
countered in practical problems may be interj^reted geometrically in this 
simple fashion. 

In order to develop the appropriate analytic relationships, one may 
begin by visualizing a sphere of diameter D and any two tangent planes 
(the z- and f-planes). I'he two points of tangency determine uniquely a 
great circle on the sphere. The plane containing this great circle is con- 
veniently chosen as a cross-sectional plane in which to indicate graphically 
further pertinent geometrical relationships. P'igure 34 shows the system 
as viewed in this plane, to which the and f-planes are both orthogonal 
and hence appear as lines. I'he two polar axes, which are diameters of the 
sphere and are normal to the and s'-planes at their respective points 
of tangency, make an angle with each other which is denoted by 7. 
The real and imaginary axes in the and s'-plan.:s have orientations which 
at the moment may be considered to be arbitlar^^ 

The points in the f-plane corresponding to c = 0 and 3 = x are 
denoted by fo and f*, respectively, whereas those in the 3-plane cor- 
responding to f = 0 and f = x are denoted by 3o and 3». These four 
points lie in the plane of the polar axes, and their locations in the 3- and 
f -planes are shown in Fig. 34. From the geometry it is readily seen that 

Izol = Ifol =Z?tan^ [534] 

*Thc possibility of doing so should be evident from the geometrical proposition that the 
process of stercographic projection carries circles in the plane over into circles on the sphere, 
and vice versa. Hence a circle in one of the planes is transformed into a circle in the other 
plane. Any univalued analytic transformation which carries circles over into circles is expres- 
sible in terms of a linear fractiomil function. 
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and 


I=.l = is-.i = z^cot 2 [535] 

in which the absolute value sii^ns are necessary because the lines repre- 
senting the c- and {‘-planes in Fig. v^4 are not necessarily coincident with 
the real axes in these planes. 



Fig. 34. In many cast-s, a linear fractional transformation may be visualized as 
a twofold stereographic projection in terms of two planes tangent to the siime complex 

sphere. 


With reference to the linear fractional function 488 and its inverse as 
given by Eq. 492, it is expedient in the subsequent considerations to 
observ'e that only three of the four constants a, b, c, d are independent, 
since the right-hand members of these relations have the same value if 
their numerators and denominators arc multiplied by a common (finite, 
nonzero) factor. Inasmuch as it is assumed that c 5 *^ 0, one may choose the 
constant 1/c as this common factor, or, what amounts to the same thing, 
arbitrarily set c equal to unity. Equations 488 and 492 then yield the 
results 

Zo = — ^ = -d [536] 


and 


fo = 


b 

d 




[ 537 ] 
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Comparison with Eqs. 534 and 535 shows that 


\„\ = cot ^ [5381 

and 

|6| = Z)2 [539] 

Rcfcrrinj^ to Fig. 34, one recognizes that the angles of the complex 
quantities Zq and (in the complex c-plane) must differ by x radians 
and that the same is true for the quantities fo and f *. In view of Eqs. 
536 and 537, this fact yields the conclusion that 

must be a negative real number [540] 

This result, together with the condition that jal = represent the 
restrictions which must be im|X)sed ufx)n a given linear fractional function 
in order that it may jK)ssess the simple geometrical interpretation dis- 
cussed here. For any given linear fractional function fultilling these con- 
ditions, the diameter of the sphere and the angle y between the polar 
axes are given by Fejs. 538 and 539, whereas the orientations of the real 
and imaginary axes in the c- and ('-planes are obtained from Eqs. 536 and 
537, 'riuis the corresponding geometrical configuration is determined. 

Conversely, if the geometrical configuration for the sphere and the tw^o 
tangent planes is jdven, and one wishes to find the corresponding linear 
fractional function, three of the relations contained in Eqs. 536 and 537 
may be used to determine the complex constants u, /), and d. The magni- 
tudes of these constants must, of course, agree with the values given by 
Eqs. 538 and 539. This determination is always possible. 

As an illustration it is interesting to consid.r the transformation 523 
by means of which the interior of the unit circle in either plane (:; or f) is 
mapped ufx)n the right half of the other plane. By inspection of Eqs. 523 
one sees that 

a=— 1 &=1 d=l [541] 


and also that 


Zo= I = -1 

fo = 1 r- = ~1 


These values are consistent with Eqs. 536 and 537 as, of course, they 
should be. From Eqs. 538 and 539 one obtains 



[ 543 ] 
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The transformation in (luestion is, therefore, given geometrically 
(according to the present interpretation) by means of a sphere of unit 
diameter and a pair of tangent planes which are normal to each other as 
well as normal to the plane containing the jxdar axes, riiis configuration 
is shown in Fig. 35. Although the values 542 uniquely determine the 
relative orientations and |X)sitive directions for the real axes in the c- and 
i'-planes, the jx)sitive directions for the imaginary axes are not uniciuely 
lixed, because all these values are real. If the imaginary axis for the 

c-plane is assumed to j)oint vertically 
upward from the ])lane of the paper in 
Fig. 35. and the unit circle in the 
::-plane is traversed in the counter- 
clockwise direction so as to keep the 
interior on the left, then, since the im- 
aginary axis in the must be 

traversed in such a direction as to 
keep the right half j)lane on the left, 
one obserxes that the imaginary axis 
for the ('-plane jx)ints vertically into 
the plane of the pa[XT in Fig. 35. 
This choice for the {Kxsitive directions 
of the imaginary axes places the top fac'es of the c- and j'-planes oi>- 
posite the sphere. It is alternatively jx)ssible to assume both |x)sitive 
directions for the imaginary axes reversed, whence the top faces of the 
planes appear adjacent to the sphere. 

To anyone who has developed a facility for visualizing geometrical 
objects in three dimensions, this interpretation for the transformation 523 
affords a useful means for correlating its various detailed characteristics. 

25 . A MORE GENERAL MAPPING FUNCTION; THE SciIWARZ- 
Christoffel formula 

The principal field of u.scfulness with regard to conformal mapping is 
found in connection with problems in potential theory. When the geome- 
try of the physical system exhibits longitudinal uniformity in one of its 
dimensions, the problem reduces to a two-dimensional one. As shown in 
Art. 5, the real and imaginary parts ii and v of a function of a complex 
variable satisfy Laplace's equation in two dimensions and hence represent 
potential functions. Since the loci for n = constant and v = (onstant 
form orthogonal families, they may be regardecl as tlie equijxitential and 
flow lines of a nonturbulent field, that is, of a pure [xitential field. If a 
pair of such functions can be found whose loci conform to the geometrical 
boundaries of a given physical system, the solution to the boundary 
value problem for that system is thereby given. 



Fig. 35. Visualization of the trans- 
formation s = (1 - ( 1-h k ) as a 

twofold stereographic projection. 
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For example, Poisson’s integral, Jvp 323 or Eq. 324, yields the rx)tential 
inside a circle in terms of specified boundary values. This integral, there- 
fore, constitutes a solution to the boundary value problem for the circle, 
the specific forms given by F^qs. 332 and 333 expressing this solution as a 
sum of exjx)nential functions. 

By means of a function which conformally maps the interior of a circle 
ujx)n a region having a dilYerent geometrical configuration, the boundary 
value problem for that configuration may be transformed into an equiva- 
lent one for the circle, and the desired solution is found through applying 
to the solution for the circle, the inverse transformation. 

The boundary is commonly an equipotential locus, so that the specifica- 
tion of boundary values amounts merely to stipulating that the potential 
function shall be constant on the boundary. In electrical problems, this 
boundary is the surface of a conductor or of a material having a large 
permeability relative to that of the surrounding medium. The solution 
to the boundary value problem for the circle then consists simply of 
concentric circles for the equipotential loci, with the orthogonal radial 
lines representing the Ilow field. The origin, or center of the circle, and 
the ix)int at infinity represent the source and sink for this field. 

An even simpler geometrical configuration is given by a pair of bounda- 
ries in the form of parallel lines. When these are constant potential loci, 
the How map consists of a rectangular grid of fiow lines and equipotential 
lines. Such a rectangular region is mapped upon the interior of the circle 
by means of the function 

z — or t — \n z [544] 

Writing 

c = and / = ;;/ -f jft .545] 

one finds from Eq. 544 that 

m = In r and ;/ = <;> [.546] 

If the origin (r = 0) and infinity (r = oo) in the c-plane represent the 
source and sink for the field, the radial flow enclosed by a pair of lines 
0 = 01 and 0 = 02 (these may be 0 = 0 and 0 = Iw) is mapped upon 
the /-plane as a parallel flow enclosed by the straight lines n = 0i and 
n = 02. The equiix)tential loci, which in the :;-plane are the circles 
r = constant, become the straight lines m = constant, which are at right 
angles to the flow lines n = constant. Source and sink become the points 
m = -t These matters are illustrated in Fig. 36. The rectangular flow 
map in the /-plane may be regarded as a reference field (a flow map of the 
simplest type) to which the radial type of flow map is transformed by the 
function 544. 
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InasmiK'h as the upper half of the c-j^lane may hv eoufonnally mapped 
upcm the upper half of the f-plane, aeeordiiii: to the discussion of the 
previous article, with any two desired }>oints'^ on the real axis of one 
half plane corresponding to the origin and inlinity in the other, it follows 
that a tlow map in the upjxT half plane, with any two |X)ints on the real 
axis designated as the source and sink, may be reduced to the reference 
held in the /-plane of Fig. M). If transformations can be found which map 
regions having other geometrical conligurations for their boundaries, 
u{X)n the upper half plane, or u|K)n the region enclosed by the unit circle, 
a way is established for also reducing the how maps for these configura- 
tions to a simple rectangular reference tield, thus making jx)ssible the 
solution to boundary value problems in these more complicated cases. 



Fig. 36. Transformation of a radial flow map to a simpler ilow pattern by the 

transformation z = 

An extremely useful map{)ing function, of considerable generality in 
its ability to meet various geometrical configurations, is given by the 
so-called ScJni'arz-ChristoJfel formula, which reads 

u<(z) = M r (f - • • -(r - s„r'^ df + at [547] 

Jzq 

Here f is a running variable in the s-plane, Z\,Z2,-‘‘ Zn are « finite points 
on the real axis, numbered in such an order that 

Si < S2 < • • • < Z„ [548] 

and the quantities mi, M2, • • • M« appearing in the exponents are any set of 
positive or negative real numbers. The constants M and N may have 
complex values, with the possibility that iV be zero, but M must, of 
course, have a nonzero value. The lower limit So of the integral is an 
arbitrary point in the upper half plane. It may be chosen equal to zero, 
or equal to one of the points Zi • • • z„. The principle guiding this choice is 
best seen from the illustrative examples given subsequently. 

*A third point (for example, midway between the other two) may be chosen to correspond 
to the point -hi (which lies between the origin and infinity). 
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The independent variable for the mapping function w{z) is the upper 
limit of the integral. For this reason the derivative of the function is 
given by 

^ = Vl/(2 - 0, )-"■ (0 - 02)-^ • • • (Z - Zn)-*^ [549] 


as may be seen from the fact that if one has 

Ji’(s) = jf /(f) 

the usual definition for the derivative 



yields 

w{z + Ac) - = jf /(f) (ff 


[550] 

[551] 

[552] 


Since is a small displacement (becoming zero in the limit), one may 
say that for the integration in Kq. 552 the function /(s') is essentially 
constant and equal to the value j\z). It is assumed, of course, that the 
function /(s') is continuous in the vicinity of the point f which is a 
recognized condition for the existence of the derivative in the first place. 
With /(f) equal to the constant value/(:;), it may be placed in front of the 
integral sign, and Eep 552 yields 

d^=f{z)Az [553] 


the approximation becoming exact in the limi; Ac— >0. Completing the 
limit, one finds, therefore, that 

[554] 

The essential character of the function u'{z) may now be recognized 
from a study of the behavior of the derivative 549 in the vicinity of the 
points c = Zv. The first step in this direction is to represent the factor 
(z — c,.) in the polar form as illustrated in Fig. 37. This representation 
reads 

(z - z,) = |z - [-555^ 

in which k is an integer. 

Then 


(z - z,)-*^ = |z - 


[556] 
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Since the quantity is not necessarily an inteLrer, the ri^ht-hand side of 
Kq. 556 may have many different values for different integer values of k. 

In order to remow this multivaluedness of 
the factor i:; — c, it is specified at the 
outset that k shall assume only the value 
zero. 'I'his s|K‘cili('ati(Ui is ecjui valent to 
stating that the function die dz is to he 
studied on only one of the many leaves of 
its Riemann surface, namely, on that one 
which corres|'K)nds to k 0 in Kc|. 556. A 
typical factor in I‘lq. 549 then becomes 

(C - ^5571 



Fig. 37. Roprosontation of 
z - z. in polar form in the 
Studv of dli' dz. 


and if the jx)int c is allowed to lie only in the uj)per half plane or on the 
real axis of the c-plane, it is clear from Fig. 37 that 


When the polar forms 


Qi’ ^ TT 

[558] 

M 

[559] 

- — 

■ M 

[560] 

* • * Bnd>n 

[561] 


and 


are introduced, it follows that 

d = a — Ati0] 

It is now assumed that the variable c in the function 549 is restricted to 
real values only; that is, the variable {x)int c is thought of as moving along 


2 -plane 



Fig. 38. The path along which dw dz is stufliecl in the Schwar/.-ChristofTcl trans- 
formation. 


the real axis from — x to x , the oijly deviation from this behavior occur- 
ring wherever the variable point z encounters one of the critical {x>ints 
There it makes a slight detour around the critical |x)int instead of pass- 
ing directly through it. These detours may be visualized as having the 
form of vanishingly small semicircular arcs lying in the upper half plane, 
as shown in Fig. 38. As the point z traverses a small semicircular arc in 
the vicinity of the point Zy, the angle <t>v changes from the value tt to zero, 
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whereas the angles of the remaining fac tors do not change at all because 
of the assumed vanishingly small radius of the semicircular detour. 
Hem e for the range 

— 1 ^ z ^ [^562j 

one has 

4>\-<P'i= ••• =0p.-i=O <^p+i=<^p+2= ••• =0n = 7r [563] 

and* according to Ec^. 561 

o— /ip-4 1 + • • • ^ 0 ^a— (/ip+i-f iUp+2H-‘ * *+Mn)^ [564] 

4'hroughout the range 562, the angle 0 is, therefore, increased by the 
amount 

AO = fJLi^TT [565] 

the important feature being that this increment occurs only as the point 
traverses the small semicircular arc. In other words, as the point z moves 
along the real axis, the angle 0 remains constant as :: proceeds from one 
of the critical ix)ints to the next, receiving a sudden increment AO = 
only as z passes directly over the critical point Zy. 

According to the discu.ssion of conformal mapping in Art. 2, it is recog- 
nized tliat the map of the function w{z) in the ic-plane, correspond- 
ing to the real axis in the ::-plane, 
consists of a succession of straight- 
line segments between the points xc’i, 
iCj, • • • corresix)nding respectively to 
-i, -jc • • ) the angular directions of two p. 
consecutive segments confluent in the ' 
jx^int tc,. dilTering by g,7r. That is, the 
map in the a'-plane of the function 
547, corres|X)nding to the real axis in 
the c-plane, traversed from — to x , _ 

has the general character shown in ® 

Fig. 39. 41iis result follows from the t-u • i 

tact, |X)mted out in Art. 2, that the of the real axis in the z-plane shown 
angle of diu dz equals the difference be- in Fig. 38. 

tween the angles of the increments dio 

and dz, and since the angle of the latter remains zero as the point z 
moves along the real axis, the angle of diu dz must equal that of duK 
This angle, however, is shown to remain constant except when z passes 
over one of the critical quantities Zy. At the corresponding points u\ 
then, the direction of the increment d’w suddenly changes by the 
amount g.Tr. 



If is negative, the inequalities in Etj. 564 are reversed. 



Fiwcrioxs OF A C(U//V./!:.V variable 


(C7/. VI 




The plot in the u-phine eorresix^ndinj^ to the real axis in the c-plane 
is thus seen to be a tx>lygon with the |x)ints u'l • • • as its vertexes. If 

Ml T T ‘ ‘ T M/j = 2 [566] 

the sum of the increments Xd at the ;/ vertexes iCi • • • a'„ equals lir. 
Since Itt equals the sum of the external anodes of a closed fxdy^on, it 
follows that when the condition 566 is fuliilled, the jxilyi^on is //-sided. 
Unless one or more of the values of jui ■ • • /i,, are ecjual to or greater than 
unity, all the // vertexes of the {ndygim lie at finite points corresjxuiding 
to the finite values -i • • • z,, on the real axis of the ::-plane. The jioint at 
infinity in the c-plane corres|xmds to an ordinary jxiint on the straight- 
line segment joining the vertexes ic ., and iCj. If a factor has a {xisitive 
value equal to or greater than unity, the corrcsjxinding vertex of the {xily- 
gon in the :c-plane lies at infinity. This circumstance is subsecjuently dis- 
cussed in detail. 

When the condition 566 is not fulfilled, the jxu'nt at infinity in the 
c-plane also corresjxmds to a vertex, and the jxilygon in the tc-i)lane has 
;/ -I- 1 sides. In order to consider this jx'is.^ibility more fully it is neces- 
sary first to study the function iciz) in the vicinity of the |X)int c = . 

For this purpose it is convenient to make the substitution 


= > with z*y = 


-1 


Then 

and 


(f - Zy)~^^ = • (s'* - 

so that Eq. 547 takes the form 


= M f 
in which 


(f* - • • (f* - A)' 


[567] 

[568] 

[569] 

di* + N [570] 


At 



[571] 


The derivative reads 

dw ■ • • ( 2 *„)'‘"( 2 *)<'“+« + -" +'‘"- 2 ' 

d2* ~ (2* - Z*xYKz* - Z*2r • • • (z* - Z*nT" 


[572] 


The point 2 = <» corresponds to s* = 0. For the vicinity of z* — 0, 
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Eq. 572 gives 


dw 

d~? 




+Mn-2) 


[573] 


Hence it is clear that the point s = oo yields a vertex at which the angular 
iiuTement AO has the value 

A0= (2 — Ml — /io — • • • — [574] 

d'his vertex also lies at a finite point in the ic-plane unless the quantity 
(2 — — /ij — • • • — yin) ecjLial to or larger than unity. It is obvious 

that the vertex does not exist if the condition 566 is fulfilled. 

In order to study the behavior of the function zc in the vicinity of one 
of its vertexes, one may first write for the derivative 549 

— = fc — ~ ~ -!“•••] [575] 


in which the bracket expression is a Taylor expansion for the function 
549 with the factor (c — missing. For this Taylor expansion, which 

is evidently jKrssible because the function it represents is regular at the 
point the coeflicient </(, is certainly not zero. Hence assuming 

y^ < 1 the term by tenn integration of Eq. 575 yields 


. = r... = 


+ : — ; r 


+ 1 


+ 2 


+ ti-, [576] 


For the immediate vicinity of the vertex at the point w = 'w,, therefore, 
one has the representation 


lu ^ + 


ao(2 - 

1 - yi. 


[577] 


This analysis shows that so long as < 1, the function w{z) is regular 
in the vicinity of the vertex This v’ertex may hav^e a v^ariety of ap- 



pearances depending upon the particular value of < 1. Some of these 
are shown in Fig. 40. The arrows on the line segments indicate the direc- 
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tion in which the contenu* of the polyjxon is traxersed as the point c 
travels in the |xisitive direction ttrom — ^ to \- ) along the real iLxis 

of the c-plane. I'he region enclosed hy the polygon, which is that on the 
left of the contour, is shown shaded. 

Part (a) of the hgure shows the ai)pearance of tlie vertex when 
0 < < 1. Here the external angle has a positive value between zero 

and TT. Part (b), for which the external angle has a negative value be- 
tween zero and — tt, corresponds to — I < < 0. Part (c) shows the 

vertex for ~ 1. Here the external angle ecjuals — tt. Finally, jxirt (d) 
illustrates the appearance (d* a vertex at which the external angle is less 
than —TT (that is, equal to the negative of a (juantity which is larger 
than tt). In tliis case the enclosed region involves a double-mapped 
portion. 

If /i, = 1, Eq. 575 yields 

— = ao(s — z»Y~^ + ^1 + ^12(2 -v) + * * • 

az 

and the integration then gives 

vj = J* dw = do In (2 — + ai(z — Zv) + ~(s — z,,)“ -}-••• -f C [579] 

in which 

C — "”^0 (^0 ““ • • • [08O] 

In this case the vertex evidently lies at infinity, since the value of the 
right-hand side in Eq. 579 becomes infinite for c = Zy. Figure 41 shows 
how this vertex may be imagined to appear, if 
it is assumed that one is pennitted to indicate 
infinity as a finite [X)int. The enclosed region in 
this vicinity is null, since it is contained be- 
tween line segments which fall upon each 
other. In the finite region of the cc-plane the 
appearance of a polygon with a x’ertex tcv of 
this sort at infinity is shown in Fig. 42. The 
vertex of Fig. 41 at infinity is seen to result fiom a j)air of confluent line 
segments which are parallel and hence meet or intersect at infinity. 

When Ht, has any value larger than unity, the corresponding vertex 
also lies at infinity. Figure 43 shows the appearance of a jxilygon with 
such a vertex. The external angle lies between tt and It, and g*, has a 
value between 1 and 2. F’or nu = 2 the external angle equals Iw. Analysis 
similar to the preceding then shows that the function luiz) has a simple 
pole at the jxiint z = in addition to having a logarithmic infinity there. 

In the light of the preceding discussion and interpretation it may now 





Fig. 41. Appearance of 
the vertex when mp - 1. 
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be stated by way of a surimary that the function defined by the integral 
547 uniquely and continuously maps all points on the real axis of the z- 
plane upon the boundary of a [X)Iygon in the Zt^-plane. Since its derivative 
as expressed by Kqs. 549 and 572 is regular at all points in the upper half 
of the c-plane inclusive of the {xunt at infinity, it follows that this func- 
tion is regular and continuous throughout this entire half plane and is 
single-valued by virtue of the stipulation regarding the choice of values 
for the factors It may be concluded, therefore, that the 

function also maps the entire upper half of the s-plane upon the region en- 



Fk;. 42. Af)pcanina‘ of the polygon 
in the finite te-plane when one of its 
vertexes corresponds to mv = 1. 



Fig. 45. Appearance of the polygon in 
the finite ic-plane when one of its 
vertexes corresponds to > 1. 


closed by the |X)lygon in the 2c-plane, for any closed boundary lying wholly 
inside the }X)lygon must by reason of the continuity and single valuedness 
of the function also lie wholly within the upi er half of the s-plane. The 
availability of this mapping function and tliat represented by the linear 
fractional form discussed in the preceding article greatly enhances the 
means for soh ing two-dimensional boundary value problems, as is illus- 
trated by the following examples. 

The first i^roblem to be discussed is the determination of the field dis- 
tribution in the vicinity of the edges of a parallel-plate condenser. Since 
the distortion of the field fringing is confined to the more im- 
mediate vicinity of the edge, the plates may be assumed to be infinitely 
wide, and since the field distribution is symmetrical about a center line 
between the plates, it is sufficient to map the field on one side of this center 
line only. The region over which a field map is desired may, therefore, 
be sketched as shown in Fig. 44. The edge of the condenser plate is at the 
ix)int C above the origin of the xp-plane. The plate itself is parallel to the 
real axis of this plane and extends infinitely far to the left. The real axis 
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represents the center line between the platen , so that the distance d 
equals half the spacing between them. 

The shaded area, throughout which the field map extends, is regarded 
as the region enclosed by a polygon which has three vertexes. One of 

these lies at the finite f>oint L\ and 
the other two lie at infinity. One of 
these latter two results from the 
region J with its jiarallel boundaries 
extending to infinity on the left. 'I'his 
one, which ma.y be designated as 
“ vertex .1.** ex idently has the char- 
acter ot the vertex at intinity for the 
jx)lygon of Fig. 42. fhe other vertex 
at infinity, which may be designated 
as vertex B, " has the character of the vertex in Fig. 44 for = 2. 
It represents the infinite extension of the region B to the right and upper 
left of Fig. 44. 

The electric tlux being considered as the tluid. the tlow in the jiarallel- 
plate condenser is from one of its jdates to the (Uher. whereas the e(jiii{K)- 
tential loci are the orthogonal curx'es symmetrically grouped alxnit the 
center line. However, from a mathematical standixiint it is e(jually 
admissible to consider the tlow in the sj)ace between the plates to be in 
the general direction along the center line, with the ecjuipotential loci 
extending from one plate to the other. In other words, the tlow map 
consisting of orthogonal families of loci depends only u|)on the geometrical 
configuration of the boundaries, and hem e it makes no difference which 
of the families of curv'es is thought of as representing the flow of a jdiysical 
fluid. In the present instance it is convenient to think of the tluid as 
flowing sideways, with the plates as longitudinal boundaries and the 
vicinity of the edges as the throat from which this tluid issiu's into the 
entire surrounding space. .According to this jx)int of view, the vertex A 
becomes the source and the vertex B becomes the sink. 

If the region enclosed by the polygon of F'ig. 44 is now map{Xid iqxm 
the upper half of a c-plane in such a way that the origin and the ix)int 
at infinity for this plane are identified respectively with the vertexes 
A and B of the polygon, the equivalent flow map in the c-plane is simply 
given by the concentric circles about the origin (equiix^lential loci) and 
the radial lines extending from the origin to infinity (flow lines). The 
vertex at the point C in Fig. 44 is then represented by a finite i)oint on the 
real axis of the z-plane. 

In view of the fact that the points on the real axis of the c-plane cor- 
responding to the vertexes of the polygon must be arranged in such an 
order that they are encountered in the same sequence during a traversal 





/, '' 

cb ;- ' 



w;-plane% 

y 



' ^ 




C 

\ a '- 

'/V 

cehte 

r line 

0l 

\ 


Fig. 44. Relevant to determining the 
field of a parallel -plate eonden.-^er. 
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of the real axis in the positive direction (leaving the upper half plane to 
the left ) as are the vertexes of the {X)lygon when its boundaiy^ is traversed 
in the corresponding positive direction (leaving the enclosed region to 
the left), it follows that the iK)int on the real axis of the c-plane cor- 

resj)onding to the vertex C must lie to the left of the origin as indicated 

in Fig. 45. 41iis is the point designated a> 
c = Cl. Since thi> vertex is like the one 
shown in part (c) of Fig. 40. the corres- 
ponding exponent /ii has the value —I. 

4'he vertex .4 , which corresj>onds to c = 0, 
is like the one shenvn in Fig. 41. I'he 

M-value for this vertex, therefore, is H-l. 

-As pointed out in the j)receding discus- 

sion, a vertex which cc^rresponds to the 
point c = is not rcpre.sented by a factor of the form (f — 
in the integral 547 but comes about by virtue of the behavior of this 
function in the vic inity of the point at infinity. In other words, the cor- 
responding factor is implicitly rather than explicitly contained in the 
general form of the integral, which for the present problem is now recog- 
ni/.ed to read 


I 

z -plane 



Fig. 45. Contour in the z-plane 
for the problem of Fig. 44. 



di 

--.r'ls- - o)> 


+ .Y 


[ 581 ] 


Thus for the vertex B (^corresixinding to z — «>) one has, according to 
F]q. 574, 

Aifl = (^2 + I — l)7r = 27r [582] 


•which checks with the abo\’e statement that this vertex has the character 
of ii\. in Fig. 43 for /i,. = 2. 

Equation 581 may more appropriately be »vritten in the form 





zx)d^ 

s' 


+ 


[583] 


It is now convenient to choose the lower limit Cq equal to Si, corresponding 
to the vertex C' in Fig. 44. Here tz’ must have the value jd, and since the 
integral in Eq. 583 vanishes for c = Co = Si, it follows that 

X^jd [584] 

The constant M may be evaluated through calculating the increment 
in the function w correspemding to the increment in s resulting from a 
traversal of the semicircular detour about the point s = 0. For this 
semicircular path one may write 


[ 585 ] 
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in which p is the radius of the small arc. For small v alues of ^ the factor 
(v Cl ^ in the integral (^f !*.(]. 5S.> may he rej)laced hy — Ci, aiul inasmuch 
as Eq. 585 yields 


ill 

s* 


= j d(f> 


[586] 


it is seen that the increment in the function corresjxmding to the arc 
increment in :: becomes 


Aic 


= —ZiMJ d<p = jirZxM 


[587] 


This increment in ic must equal the change in the value of u* which cor- 
responds to passing around the vertex A. With reference to Fig. 44 one 
sees that this change ecpials the increment in :c between the bottom face 
of the condenser plate and the center line, and hence that this increment 
in IV must equal —jd. 'Finis 

A;c = -Jd =jTrz,M [588] 

whence 


4/ = - 

TTZi 

The representation for the function Xc- now reads 

d r^(i--zj)di: 

__ jd 


“--7' 

TTZi Ozi 


The integration, which is straightforward, yields 

li' ^ ~ ^ J + 


[589] 

[590] 

[591] 


Since there are no further conditions to be satisfied, it appears that the 
location of the point Si on the negative real axis in Fig. 45 may be chosen 
arbitrarily. A choice which yields a simple form for the resulting mapi)ing 
function is Zi = —1. Since in ( — 1) = yV, Eq. 591 becomes 

w; = ^ [1 + s + In s] [592] 


This is the desired mapping function which converts the family of 
concentric circles about the origin of the 2 -plane and the orthogonal 
family of straight lines radiating from the origin (both confined to the 
upper half plane) into the orthogonal families of loci which map the field 
and the equipotential lines for the condenser plate in F4g. 44. If desired, 
this map may be converted to the rectangular reference form shown in the 
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/-plane of Fig. 46 through making the further change of variable given 
by Eq. 544. Then Eq. 592 becomes 

tc' = —[1 T / -f- 

TT 

The reference field in the /-plane, with several points of particular 
interest marked on it, is shown in Fig. 46. The edge C of the condenser 
plate in Fig. 44 where w = jd corresfxinds to / = yV. The under side of 
the condenser plate in Fig. 44 corresponds to the portion of the horizontal 
line n = /V to the left of C; the 


toj) side of the plate corres|X)nds 
to the {K)rtion of this line to the 
right <d‘ C’. 1'he center line in Fig. 
44 is given by the real axis of the 
/-f)lane. 'Die transformation 59.^ is 
.‘^een x’irtually to remove the v>60- 
degree bend in the boundary at 
the edge U in Fig. 44 so that the 
toj^ side of the condenser plate 
appears as a linear extension of the 
terest to note that the origin of Fig 
the |K)int C in the /-plane but is 



Fig. 47. Relevant to plotting the 
How in a right-angle bend. 


^-plane 



Fig. 46. The field of the parallel-plate 
condenser in the /-plane. 


bottom side. It may also be of in- 
. 44 does not appear directly below 
shifted to the negative real point 


Z'plane 



O A P 


Fig. 48. The contour in the 
z-plane for tlie problem of 
Fig. 47. 


I = — 1.279. The process of transferring the rectangular grid of lines in 
the /-plane of Fig. 46 to the region of interest in Fig. 44 by means of the 
function 593 is left as an e.xercise for the reader. 

As a second example let it be required to determine the field map for 
the flow around the right-angle bend indicated by the shaded region in the 
7 e-plane of Fig. 47. The polygon in this example has four vertexes. One 
lies at the origin 0, another at the finite point P, and the remaining two 
at infinity. Both these have the character of the vertex of Fig. 42. 
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They are here denoted as the vertexes A and />. 'I'hev heroine the source 
and sink respectively for the present problem. 

The points on the real axis of the ::-plane which are chosen to cor- 
respond to these vertexes are indicated in Fi^^. 48. 'Fhe source vertex .1 
is assumed to lie at the (wigin and the sink vertex B at infinity. 'Fhe 
proper order, as determined from the corres{>onding directions of traversal, 
requires that the vertexes O and P lie respectively to the left and to the 
right of the origin. Fhese arc designated as the ]X)ints z = Zi and c = cj. 

Recognizing that the exponents yu,. for the vertex{‘s f). A, P are re.'ipec- 
tively -fw, — o, and choosing the lower limit Zq of the integral 

547 equal to Zi, one has 




0)'(j--=,) 


-1/2 d" ^ 


[594] 


or 




' A- 


[595] 


Since the origin of the :c-plane c(wres|Kmds to the jxu'nt c = Ci, one has 
li'izi) = 0, and hence Fq. 595 yields 

= 0 [596] 


The reason for choosing the lower limit of the integral as the point z — Zi 
is thus evident. 

In order to study the functiiin in the vicinity of the ve rtex B which 
occurs for z ■= , it is necessary to make the change of variable 

^ [597] 


whence 

f f* 


[598] 


The vicinity f = « is identified with the vicinity = 0. 7Te increment 
in the function w resulting from passing around the vertex B at jT* 0 
is determined through considering the integral 595 for f — > and making 
the substitutions 597 and 598. This gives 

A'u; - ■ [599] 


in which the integration is to be extended over a small semicircular arc 
of radius p about the origin of the f*-plane. For this arc 

f* = pe’* 


[600] 
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and 


■A.u 

-pr 


SO that the intep^ral 599 becomes 

Atc = “7*^/ d4) = jwM 


[601] 


[602] 


But “ passing' around ” the vertex B in the ic-plane amourts to going 
from the right- to the lefl-luind boundary of the cross-hatched region B 
in Fig. 47 and hence yields an increment in the function «:• equal to —di. 
Therefore 


Sci' = —di = JttM 


[603] 


In a similar manner, passing around the vertex A corresponding to 
the vicinity of = 0 yields 

But according to Fig. 47 this must ecpial Jdo. Hence 

jdo = -JttM r./— [6051 

N 2i 

Equations 603 and 605 determine the constant 3/ and the ratio Z 2 2 i. 
Since there are no further conditions to be fulfilled, one of the points 
Zi or ::2 may be chosen arbitrarily. For the moment this choice is left 
open. The ratio is found to be given by 


, d.> z. do- 

=j-f or 

Zi di ...i di 


[606] 


This result agrees with the fact that co is jx^sitive and Zi negative. 
The required function is now represented by 

bU 


w = -- f - ^4=^ d^ 


[607] 


which may be converted into 

J(^i r 
w = — / 

TT t/z. 


f -2 


T Jz, fv"(s- - sOCs*- - = 2 ) 


or 

w 


^jdi r dr r 

r •A. v/({- - 2|)(f - Zr) ^ 'Al 


di: 




fN'(f-=l)(f-S2) 


[608] 


[609] 
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If in the second integral one makes the change of variable indicated l)y 
Eq. 597 and also the consistent changes 


^ _ 1 , -1 
- ’ -1.2 


with the help of Eq. 606, one finds 

. jji r ■'>- + _ 

The integration yields 




K \ I '-SI 

1 ) < S 




1,6 10] 


[ 611 ] 


= ^:^ln 


— + - 2 ) “b ^ ~ Z]^‘^Z — Cj '' I 


^(-1 - -2^ 


do, IZ* - }2iZ*i - Z^j) + \ ^Z^ 

+ — In , - ^ - 

TT [ 2^*" 1 - 2; 


•* - )ir* 


,6121 


Inasmuch as an arbitrary specification with regard to zi or Zo still 
remains open, it is possible, in agreement with E(j. 606, to set 

dj . d-j 


ii2 = 7- and c, ^ - 

do a , 


Then Eq. 612 takes the form 

U' = d, +-^— /.;i -r 

in which 


f(z) = In 


+ ^ // , d,\( _ (/A 

r ' 2,/,d, \V^d,/V dj 


dr + d£ 
2d\d-> 


"6131 


[614] 


[615] 


For purposes of calculation it may be helpful to note that the sub- 
stitution 


2' = 


2V/, ^ 

IM^dA 

2 \di dz) 


[616] 


converts the expression 615 to the form 

J{z') = In (2' + V(2')" - 1) [617] 

If (/j = J2, the resulting mapping function is greatly simplified inasmuch 
as Eq. 616 shows that then 2' = 2. 
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.lie previous example, the field map may be reduced to points 
.cir grid by means of the additional substitution z = 

26. Hiirwitz polyxoml\ls; stability criteria 

In problems dealing with the dynamics of a physical system one is 
fre(juenlly concerned with the question of the stability of its behavior. 
In terms of the so-called characteristic equation of the system, which 
has the form 

fln-” “h * + • * * -f~ fllS + flo = 0 [618] 

with real coefficients (in ■■■ (L,, this question is answered in the affirmative 
if it can lie established that all the real roots and all the real parts of the 
((implex roots are negative. Stated in another way, the stability of the 
physi('al system is assured if it can be showm that all the zeros of the 
|)()lynomial 

P(z) — (2„-” + Uh— 1-^* ^ -f- . . . -j. (jiz -h ao [619] 

lie in the left half of the >plane. A polynomial having this property is 
called a Iluncitz polynomial. 

Idle necessary and sufiicient conditions which the coefficients of an 
arbitrary ixilynomial iwith real coelficients) must satisfy in order that 
it be a Ilurwitz polynomial are sfxiken of as the Hunoitz criteria. In 
dynamics these same conditions are alternatively referred to as RoiUVs 
stability criteria. 

Starting with the factored form of the polynomial 619, which reads 

P{z) = an{z - Zi){z - Z 2 ) ' - {z - Zn ) [620] 

one may readily establish (by multiplying out and collecting terms with 
like jxiw'crs of ::) that 

+ Zn ^ l^n [621] 

a, I 


— = (- 1 )" -2122 •• -271 

(in 

If it is assumed that all the roots are real and negative, and that > 0, 
it is evident that all the coelficients are positive. If some or all of the roots 
arc in the form of conjugate complex pairs, then one may likewise estab- 
lish (by the use of Eqs. 621) that all the coefficients are positive if the 



FUNCTIONS OF A COMPLEX VARIABLE 


\Ch. VI 


3 % 


roots negative real parts. It is thus seen that the ix)sitiveness of the 
^,QP<iricients* is a necessary condition for the negativeness of the real parts 
jf the roots of a given algebraic equation, but this condition alone proves 
to be insurticient, as is shown by the following discussion. 

One begins by investigating the significant properties of Hurwitz 
polynomials. In these considerations two simple facts form the basis 
from which the pertinent conclusions are readily obtained. These facts 
are, first, that the zeros of the jx^lynomial P(z) lie in the left half of the 
c-plane and, second, that their distribution in this half plane is sym- 
metrical about the real axis. The last statement is evident from the fact 
that complex roots must occur in conjugate pairs. 



la) (b) 

Fig. 49. Relevant to the properties of a third degree Hunvitz jwlypomial. 

The polynomial is consiclered to be in its factored form as given by 
Eq. 620. P’or an arbitrary value of the com])lex variable the several 
factors for a polynomial of the third degree may be represented graphically 
as shown in Fig. 49. Parts fa ) and (b ) of this figure show how the graphi- 
cal representation of the factors changes when the algebraic sign of the 
variable is reversed. It is clear that if - is chosen to lie in the left half 
plane fas in part (a) of Fig. 49) — g lies in the right half plane, and vice 
versa. It may also be seen that if is replaced by its conjugate value 5, 
the factors collectively represent the same set of magnitudes since pairs 
of factors representing conjugate roots merely become interchanged. 
This is, of course, also obvious from the fact that P{z) is the conjugate 
of P(z)y and hence these two values of the polynomial havx the same 
magnitude. On the other hand, if the point z is replaced by its image 
about the imaginary axis fthis amounts to replacing z by its negative 
conjugate v^alue), collectively the magnitudes of the factors change in the 
same way as they do when s is replaced by —z. Points which are images 

*It is obvious that one may alternatively state that all the coefficients must be negative, 
the significant point being that they all have the same algebraic sign. 
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with respect to the imaginary axis are regarclecl as corresponding points 
in the left and right half planes. 

ft should be clear from the representations in Fig. 49 that for any point 
z in the right half plane, the magnitude of the polynomial is larger than 
it is for the corres|:K)nding |X)int in the left half plane. Together wdth the 
considerations of the preceding paragraph, this fact enables one to see 
without didiculty the truth of the follow'ing statements: 

'P(s)! > IP(-s)' for/;e(2)>0 

= !P(- 2 )' for A;c( 2 ) = 0 [622] 

\P{z)\ <'P(-z)\ for/ee(2)<0 

in which Re denotes “ real part of.” Letting 

P{z) 

[623] 


one may alternatively e.xpress these results by 

qIzY > 1 for Re(z) > 0 

<PU): = I (or Rc[z) = 0 [624] 

[<t>{z)\ < 1 for Reiz) < 0 


It should be clearly recognized that these statements hold only if 
P{z) is a Hunvitz jx)lynomial, for if P{z) has any zeros in the right half 
plane, ix)ints can certainly be found for which these statements collec- 
tively are no longer true. One may, therefore, conclude that if the condi- 
tions 622 or 624 are true, P(z) must be a Hurwitz polynomial. The rela- 
tions 622 or 024 are the necessary and sufficiefU conditions that a given 
polynomial P{z) have zeros in the left half p ;ine only. By means of the 
succeeding manipulations, these conditions are put into a more usable 
form. 

The first step in this direction is to introduce the function 


4>{z) + 1 ^ P {z) + P{-^z) 
<t>{z) - 1 P\z) - P{-z) 


[625] 


According to the discussion given in Art. 24, this transformation maps 
the interior of the unit circle in the 0-plane upon the left half of the 
0-plane. Hence one has 

Re{i) > 0 for |</.(s)l > 1 

Reif) = 0 for | 0 ( 2 )! = 1 

Rei^p) < 0 for |<;>(s)| < i 


[ 626 ] 
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and, with the use of the relations 624, one obtains 
Re{\p) > 0 for Rc(z) > 0 

{orRe(z) = 0 [627] 

< 0 for Rc(z) < 0 

If the polynomial P(z) is written in the form 

P{z) = m{z) 4- n{z) [628] 


in which wfe) represents the terms involvinjx even powers of (called 
the even part of P) and ;/(::) represents the terms involving odd |X)wers 
of z (called the odd part of P), according to Eep 625, one has 


iiz) 


tn(z) 

H{z) 


[629] 


It is thus established that if m(z) and n{z) are the even and odd parts 
respectively of a Hurwitz polynomial, the rational function 629 has the 
properties expressed by the conditions 627, and, conversely, if the ratio 
of the even and odd parts of a given [x>lynomial yields a rational function 
having the properties 627, that polynomial must be a Hurwitz ixdynomial. 

These properties are now examined in greater detail. Sui){X)se the 
rational function yplz) has a pole of the order s at some point z ~ Zy* 
The Laurent series for 4/{z) in this vicinity then reads 


~ (r z + • • * + ■^Tzzryr + ^v) + 


For points very close to Zy one may write 


'Piz) [631] 

Iw uy) 


Letting 

II 

T 

[632] 

and 

(z - 2 .) = pc’” 

[633] 

one has 

l^(z) ^ 4 

[634] 


p 


whence 

k 

Re{4>) ^ — cos {sa — p) 

[635] 


P 


It is thus seen that, in the immediate vicinity of the pole, the real part 
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of yj/iz) assumes large negative as well as large positive values. More 
specifically, as a is allowed to vary from 0 to lir (the vicinity of the pole 
is explored through passing around it on a concentric circle of small 
radius), the real part of ^(s) is observed to change sign 2s times. 

With reference to the conditions 627, one is forced to conclude im- 
mediately that the function cannot have poles in either the right or 
the left half plane. This fact restricts the poles of ^(s) to lie along the 
imaginary axis of the c-plane. But the conditions 627 together with the 
Va\. 635 impose further restrictions upon these poles. The only conditions 
under which Kq. 635 for a pole on the imaginary axis does not conflict 
with the restrictions 627 are that = 0 and ^ = 1, for then Eq. 635 reads 



According to Eq. 633, Re(z) > 0 corresponds to — 7r/2 < a < 7r/2, and 
R€{z) < 0 corresponds to tt 2 < a < 37r 2, whereas for Re(z) = 0, 
a = ±Tr^2. Equation 636 is thus seen to yield a real part of which 
behaves in agreement with the conditions 627. The restriction that 
5 = 1 means that the ixile must be simple, and (3 = 0 requires that the 
residue of \l/{z) in this pole be real and positive. 

The conditions 627, therefore, require that the rational function \p(z) 
have poles on the imaginary axis only and that these poles be simple and 
have positive real residues. In order to see that these requirements on the 
function if/iz) are also suflicient to assure the fulfillment of the conditions 
627, one need merely regard a typical term in the partial fraction expan- 
sion of yl/{z). Such a term reads 


Since the residue ky is real and positive, and is a pure imaginary quan- 
tity, it is evident that the term 637 has the properties demanded by the 
conditions 627, and hence the finite sum of such terms which represents 
ypiz) has these properties. 

One has thus gained a new formulation for the necessary and sufficient 
conditions that P(z) be a Hurwitz polynomial. Namely, the quotient of 
its even and odd parts must be a function having simple poles on the 
imaginary axis only, and with positive real residues in these poles. 

It is collaterally useful to digress for a moment and study somewhat 
more carefully the properties of the function ^(s). First it should be 
observed that if satisfies the conditions 627, its reciprocal 1/rKz) 
does so also. Hence, using Eq. 629, the function 

1 _ n{z) 

rfiiz) m{z) 


[ 638 ] 
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must also have simple ix>les on the imaginary axis only and positive 
real residues. Both the even and odd parts m{z) and u{z) must, therefore, 
be polynomials whose zeros are simple and lie on the imaginary axis. 
Furthermore, since ix:>les of or 1 \^(c) which may lie at s = 0 or at 
c = X must also he simple, it follows that the highest powers of m(z) 
and n{z) as well as their lowest i)owers may not differ by more than 
unity. They obviously must differ at least by unity because and 
n{z) are respectively even and odd. 

If one writes 


^P{z) = 7/(.v,y) -f yi'(.v,y) 

[639] 

the conditions 627 are seen to yield 


— > 0 for.v = 0 
dx 

[640] 

The Cauchy-Riemann Kcj. 12 then shows that 


— > 0 for.v = 0 
c)v 

[641] 


But by Eqs. 627, w = 0 for .v = 0, so that this result may alternatively 
be written 


-^>0 for.v = 0 
j 


[642] 


which states that along the imaginary axis (where ^ has pure imaginary 
values only) ^ is a continuously increasing function. The interesting 
consequence of this fact is that the zeros and {X)les of ^(c), which lie 
along the imaginary axis, must alternate. 

If the polynomials m{z) and ;/(::) are written in their factored forms, 
the expression 629 for ^(c) reads 


Hz) = 


ajz^ - zr)(z- - Z:r) • • • (s- ~ zSn-\) 






'.r)(z- - Z.r) • • • (z- - z'.n^o) 


[643] 


in which it is arbitrarily assumed that the polynomial P(z) is of the 
even degree 2n. The alternation of zeros and poles along the imaginary 
axis is expressed by the conditions 

0 < [zi[ < [ 22 1 |22n— 2 I ^ |22n--l| ^ [644] 

This result is referred to as the separation property of the zeros and 
poles of \p{z). 

Using Eq. 220 for the evaluation of the residues of ^{z) in any one of 
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its poles at 2 = 22, 24, • * * 22n-2, one has 
k, = [(2 - 

^ - 2l^)(2r - 23^) ♦ » ‘ (Z,^ - Z^2n-l) 

(Zn-l2s.^(2,^-22^0 • • • (sr -2^-2) ( 2 ,.^ -2^4.2) * • * (z,^-Z%n^2) ^ ^ 


On the assumption that tin/un-i is positive, it may be seen from this 
result that the separation property expressed by the inequalities 644 
assures the positiveness of the residues of ^(2) in all its poles at finite 
frequencies. The residue of ^(2) at 2 = oc , incidentally, is recognized to 
be (In (Li-i, whereas the one at 2 = 0 has the value of 2^(2) for 2 = 0, 
which is positive since all the ejuantities are positive. 

In view of these further detailed results it may be stated that if ^(2) 
is a polynomial with positive coefficients, and if its even and odd parts 
m(z) and ;/(2) differ in their highest and in their lowest powers by no 
more than unity and have simple zeros restricted to the imaginary' axis 
where they mutually separate each other, P(z) is a Hurwitz polynomial. 

The purix)se in thus stating in a variety of forms, the necessary and 
sulTicient conditions that ^(2) be a Hurwitz polynomial is to focus atten- 
tion upon properties of these polynomials which frequently become 
collaterally iLseful. The actual process of making such a test may be 
based directly u|X)n any one of the sets of conditions already stated. 
.\ particularly effective procedure, however, is derived from these con- 
siderations in the following manner. 

For any given polynomial, the function ^(2) is readily formed accord- 
ing to Eq. 629, If P(z) is of an even degree, ^(2) has a pole at infinity; 
otherwise 1/^(2) has a pole at infinity. Whichever may be the case, one 
begins the procedure by considering that function or 1/^) which does 
have a pole at infinity. Without any loss to the ensuing argument, this 
function is assumed to be ^(2). More comi [etely represented, it has 
the form 


^(2) = 


(InZ^ + 2 ^ ^ 

(7„_l2”-‘ -f H h ^12 


[646] 


The first step in the procedure is to di\dde the denominator polynomial 
into the numerator polynomial by the common process of long dhdsion, 
however, ceasing after only a single term in the quotient is determined. 
This yields 


^(z) = 


a„ a'n—2Z” ^ 4“"~* ' ' ' ~f~ H~ d'o 

fln— 1 (In— 13 "~* + On— SS"””* + • • • + djZ 


[ 647 ] 


which may be indicated more compactly as 

,, V «n . m'{z) 
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If P(z), from which ^(c) is derived, is a Hurwitz {>()Iyn()miaI, ^(c) must 
have simple poles on the imaginary axis with jx^sitive residues. In partic- 
ular the pole at infinity, represented by the first tenn in Eq. 648, must 
ydeld a positive residue. Hence one has the first particular condition 




iJn-l 


> 0 


[649] 


If \l/{z) is imagined to be represented by its partial fraction expansion, 
one may identify the first term in Kq. 648 with that term in this expan- 
sion which represents the pole at infinity, 'i'hc remaining terms in this 
partial fraction expansion are then se(^i to rej^resent the corresponding 
expansion for the function given by the second tenn in E({. 648. 'Fins 
function is the remainder after the |X)le at infinity is removed fnmi yf/iz). 
It thus becomes clear that this remainder function must have the same 
properties as \p{z), namely, simple |x:>les restricted to the imaginary axis 
and positive real residues, and its reciprocal must likewise have these 
properties. 

Denoting the reciprocal of the remainder function by one has, 

according to Eqs. 647 and 648, 


riz) = -r- 




-f (JiZ 


+ n— I- 


4" 0 


[650] 


This function evidently again has a simple pole at infinity. Repeating the 
process of long division as before, one obtains 


^*{z) = -7 2 + 

n-2 ' 


n — r, 


+ • • • + (»'l2 


I , ^ — 4 1 

-f- <I ri— 4- “T 


-j- 4“ 


[ 651 ] 


or more compactly 


riz) 


a„_, _ , v'(z) 
a „_2 m (z) 


[652] 


The positiveness of the residue of ^*( 2 ) in its jx)Ic at infinity requires 
the second particular condition 


flti — 1 
^ n — 2 


> 0 


[653] 


The second term in Eq. 652 is a subsequent remainder function which 
again must have the same properties as rf'(z), and its reciprocal must 
also have these properties. The inverted remainder function 

^ [^54] 

again has a simple pole at infinity with a residue which must be positive. 
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The continuation of the process is thus clear, and leads successively to 
additional particular conditions like the ones expressed by the inequali- 
ties 649 and 653. The procedure terminates after all the terms in m{z) 
and n{z) are exhausted. 

Letting 





«3 



3 


[655] 


one obtains finally a representation for ^(s) of the form 

+ — 1 

aiZ i , 

^32 + 



anZ 


[656] 


which is referred to as a finite StieUjcs continued fraction.* It contains 
altogether n terms, as is clear from the following series of fractions indi- 
cating merely the degrees of the numerator and denominator polynomials 
appearing in the original function \p{z) and in the successively en- 
countered inverted remainder functions ^*(c). ^**(c), etc. 


// 


, 

>i - 1 


H — 1 11—2 

, 

11 — 2 >1—3 


2 1 

l’ 0 


[657] 


The necessary and sufTicient conditions that P(z) be a Hurwitz poly- 
nomial are now simply expressed by the statement that all the quantities 
ot 2 f as, etc., as given by Eqs. 655 must be positive. 

This procedure for testing a given polynomial may be replaced by a 
complementary one in which terms representing poles at 2 = 0 are 
removed from \f^(z) and the successive inverted remainder functions. 
One may say that this variation in the method amounts merely to re- 
placing z by 1/s and proceeding as discussed above. The test is thus 
applied to the polynomial P(l/s) instead of to P{z). Since the transforma- 

•This is one having the form of Eq. 656 in which all the coefficients are positive real numbers. 
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tion \/z maps the left half plane ujx>n itself, it is clear that is 
proved to be a Hurwitz polynomial if it can be shown that P{\/z) is such a 
one. 

To carry out tliis variation in the procedure one begins again with the 
expression 646 for but with the polynomials turned end-for-end. 
The first step in the process of di\dding the denominator into the numer- 
ator yields 




;,.T, 

JlC UiZ -f- • • * + 


[ 658 ] 


The first particular condition reads 


- > 0 [659] 

The inverted remainder must again have the same properties as \l/(z) 
because the second term in Eq. 658 is equal to the partial fraction ex- 
pansion of yp(z) minus the term for the ix)le at c = 0. 4'his second term, 
therefore, has the stated properties and so does its reciprocal. The 
inverted remainder, moreover, again has a simple ix)le at c = 0, and the 
requirement that the residue in this pole be positive yields the second 
particular condition 

-,-j- > 0 [660] 

G 2 


and so forth. 

One thus arrives at the dtemate finite Stieltjes continued fraction 

^(z) = ^12-' + -E 

P2^ I* 


in which 


+ 



[ 661 ] 


01 = 


Gq 

ai 


P2 — // 

G 2 

P3 ~ // 

G 3 


[ 662 ] 
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'Fhe necessary and sufficient conditions that P{z) be a Hurwitz poly- 
nomial are that all the coefficients • • • /3n be positive. These conditions 
are entirely equivalent to the ones involving the coefficients ai • • • an 
given by Eqs. 655, although they are different in their detailed appearance. 
It should be clear, however, that the quantities /5i • • • /3n as functions of 
the coefficients Ui, • • • of the given polynomial P(z) must be the 
same as the expressions for ai ••• an in terms of these coefficients except 
that the consecutive order of the subscripts 0, 1, 2, • • • w on the a’s is 
inverted. In other words, if in the expressions for ai • • • an one makes the 
substitutions indicated by 


Qn-I 


do — > Un 


[663] 


the corresponding ones for /Sj • • • /5n are obtained. 

In order to determine the expressions for the a’s or (5's in terms of the 
coefficients Uo •••(!;, of the polynomial P(z) from the results expressed 
by Eqs. 655 and 662, it is necessary to carry through the indicated 
processes of long division involved in the steps leading to the continued 
traction expansions 656 and 661, so as to obtain the coefficients 
a'o • • • a'n_ 2 , a ^'2 * * * etc., as functions of Jq * • * ^ this is done 

for the 15'Sj one finds that the results obtained for the first three steps 
(that is, for the determination of (^27 0 :i) yield conditions which may be 
expressed in the form 

Di — ai > 0 


Z?2 = 


do 

U 3 U 2 


> 0 


Di 


dl 

do 

0 


do 

di 

ds 

d4 

d3 


[664] 


with the condition tio > 0 tacitly understood. From the structure of these 
determinants and the recurrent nature of the process of deriv^ation, one 
may assume that the determinant of «th order has the form 




ai ao 0 • • • 0 

Cq (I 2 CLi do 0 • • • 0 

^6 ^4 ^3 ^2 ^1 ^ 0 • • • 0 


[ 665 ] 


^2n — 1 d2n~-2 


a 
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The correctness of this generalization may be verified through showing 
that it is consistent with the process of deriving an degree Ilurwitz 
poljTiomial from one of the degree n — Suppx)se the latter is written 
in the form 

Pn-i = 4- -f • • • 4- 4 iin [666] 


in which the order of the coefficients is reversed, so that the expressions 
for /3 in the former polynomial become identical with those for a in the 
present one. The corresponding function \l/{z) reads 






4- • • • d~ Cln —2^‘' J\ 


[667] 


Since this function has a zero at infinity, the one associated with a 
Hurwitz |X)lynomial of the ;/th degree may be formed by adding to the 
expression 667 a term representing a simple pxde at infinity with a posi- 
tive residue. This function may, therefore, be written 


= \pn-\ 4 

with the condition that Gq > 0. Using Eq. 667, one llnds 

^n = 

\-Gr>-iz 

and the associated polynomial reads 

Pn = GoGiZ^ 4 4 {(loG-.i 4 <72)-"““ 4" 

+ (^7o^75 + < 74 )-""^ 4 • • ■ 4 Gn~lZ 4 <7„ 


[66S] 

) -“4<7, 

^* 669 ] 


f)7()] 


From the manner of its derivation, P„ is a Hurwitz polynomial if 
Pn~i is known to be such a one, and if Uo is ix)sitive. The critiTia which 
assure that P„_i be a Hurwitz pxilynomial may, according to Eqs. 665 
and 666, be expressed by writing 


G2 

<7l 

0 

0 


<7.3 

<72 

<7l 


<75 

<74 

<7,3 

<78 

<77 

<76 

<75 


> 0 


1 671] 


with the understanding that ai > 0 and that one is to consider all 
principal minors of this determinant which are formed by the first, the 
first two, the first three, etc., rows and columns. 
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According to determinant theory it is readily recognized that 



0 

0 

0 

0 


ii'i 

il\ 

0 

0 


ii\ 

(l^ 

<l2 

ii\ 

ill 


as 

^4 

a.3 

<^9 


a^ 

ae 

as 


[672] 


which is formed from the determinant 671 by the addition of a first row 
and column as indicated. Since Ui >0, the conditions formed from this 
modified determinant arc identical with those formed from 671 except 
that one must begin with the consideration of two rows and columns in 
order to obtain successively the same conditions as before. 

The first, third, fifth, etc., columns in the determinant 672 are now 
multiplied by the positive constant Uo, and the determinant is then 
modilied in form (although not in value) by adding to the elements of the 
second column the corresponding ones of the first column, and to the 
elements of the fourth column the corres|X)nding ones of the third column, 
and so forth. This yields 


(iQdi 

(/()(/ 1 

0 

0 

aoaa 

((/oUa -f ii>) 

Uodi 

Uoai 

aoao 

(iioiis + < 14 ) 

aoaa 

(t20(l3 + fll’) 

lIqUi 

(ilodj + (Je) 

iloilo 

(<*0<*5 + Oi) 


i ' * ’ * ; 

If the constant Qq is now imagined to be factored out of the first, third, 
etc., columns, one obseiv'es that the result is a determinant Dn from which 
the criteria for the polynomial 670 are formed in the same manner that 
the criteria for the {X)lynomial 666 are formed from the determinant 671. 
Thus, by the method of induction, the correctness of the relations 664 
and 665 for the desired criteria (these are the Hurwitz criteria) is es- 
tablished. 

Routh, who first derived these criteria* (although he stated them in a 
somewhat modified form) made use of a theorem of Cauchy’s, which, for 
the sake of its collateral interest, is briefly discussed in the following 
paragraphs. 

A finite polynomial is wTitten in the form 

P{z) = uix,y) +jv{x,y) 

* Adams Prize Essay, 1877; Rigid Dynamics, paragrdph 290. 


[ 674 ] 
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and the behavior ot the quotient 

9(^.y) = [675] 

is observed as the point z = x + jy traverses once around a simple closed 
contour C which does not pass through a zero of P{z). As is shown in 
detail presently, the function ^(.v,y) is seen to change its algebraic sign 
either by passing continuously through zero or by passing through the 
value X . Considering only the continuous passages through zero, one 
observ’es the number of changes of sign from plus to minus and the 
number of changes from minus to plus. Denoting these numbers by r and 
5 respectively, the theorem of Cauchy states that J — s) is the number 
of zeros of P(z) enclosed by C. 

To prove this theorem it is expedient to write the polynomial in the 
factored form 

P(z) = ^ (s ~ - s,) . . . (s - s,) [676] 


in which ^ is a real constant and is assumed to be positive. Each factor 
may be represented in the polar form 



(z - 3.) ^ 


[677] 

whence 

II 

.r . •••+»-.) 

' n * 

[678] 

Letting 

A r\ ro ' ' 

• r„ = R 

[679] 

and 

-h ^2 + • • 

II 

+ 

[680] 

one has 

P{z) = = R 

cos B + jR sin 0 

[681] 

Reference to Eq 

s. 674 and 675 then shows that 



q{x,y) = 

= cots 

[682] 


It remains to see how the angle 6 behaves as the variable z = x + jy 
traverses once around a simple closed contour. For a single factor, as 
given by Eq. 677, this behavior is illustrated in Fig. 50. In part (a) of this 
figure the point z = z, is assumed to lie outside the closed contour C, 
whereas in part (b) of the figure the point z = Zp lies within the contour. 
It is immediately apparent that the nel change in as z traverses the con- 
tour is either zero or 2ir according to whether lies outside or inside the 
contour. The net change in the resultant angle 6, as given by Eq. 680, is 
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thus seen to be 27riV, with N equal to the number of zeros of P{z) k. 
by the contour C. Reference to the expression 682 for the quotient r^oAi 
u/v now reveals the truth of the theorem without further difficulty. 

It may be of interest to point out that the behavior of the angle 6 can 
be determined alternatively from the theorem given in Art. 19 involving 

I 2-plane 
C 


ol JT 

(a) (b) 

Fig. so. The ;z^7 change of 0^ is 0 for (a) but is Itt for (^) 

the number of zeros and jxiles of a function within a given region. Let the 
function be f{z)j and write it in the polar form 

[683] 

Then 

0 = Im (In /) [684] 

in which /;;/ denotes “ imaginary part of.” Xow 

de = Im [685] 

and hence Eq. 282 yields 

£,de ^ AO = rm£;^dz = 27r(.V - P) [686] 

Thus the net change in angle Sd is found to be equal to 27r times the dif- 
ference between the number of zeros and the number of poles of /(s) 

enclosed by the contour C. The polynomial P(s), given by Eqs. 674 or 
676, can have no poles in the finite s-plane, and hence the present result 
agrees with the conclusion reached above. 

27 . PoSITIVI^ REAL FUNCTIONS 

A rational function w = f(z) which is real for real values of s, and whose 
real part is positive for all values of z with a positive real part, is called a 
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and the function* (abbreviated p.r. function). These conditions are 
n 


/(c) real for s real 
^€[}{z)] ^ 0 for Re{z) > 0 


[687] 


Functions of this sort play an important part in electrical network theory, 
and it is therefore of interest to study their pro|)erties in some detail. 

Since the function very often is an im|)edance. it is customarily denoted 
by the letter Z. The indefxuident variable is the complex frequency, 
which is written X = <r + /co. Bein^ a rational function, Z(X) is >^iven 
by the quotient of two polynomials, and since Z{X) is real for real values 
of X, these polynomials must have real coellicients. One may write 


7 (\\ - 

(?(X) U2(<rjOi) +yt2(<r,a)) 


[ 688 ] 


in which Ui and /Zo are the real parts of the ix)lynomials P and Q resjx'c- 
tively, and t’l, Vo are the corresix^nding imaginary parts. Rationalizing, 
one finds 


R€[Z(\)] = 


U\U2 

•) 

>*" 





[689] 


a.Tv^ svnYv\c\^y 



[690; 


Since the denominators, in 
tive, it is evident that \i Z 
tive real. 


the Li^t t\ut t \[.n -Niuns must always lyc jx>si- 
X) n a). its rvrijimcaJ ^ aAw posL 


The poles of Z^X) are the /.vn,. . 
one may represent Z, Ay by the La 


-i ‘ > \ 

urvia 


I/, f};.- 


\ ■' inity of one of these, 


X, . . ;r>oij 


[692! 
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and 



[694] 

one obtains for this vicinity 


k 

Fe[Z(X)] ^ — cos (50 — 0) 
p 

[695] 


As the immediate neighborhood of the pole is explored through allowing 
0 to vary from zero to lie, one obser\x‘s that the real part of Z(X ; changes 
sign Is times. It is clear, therefore, that Z(X) can have no poles in the 
right half of the X-plane. It follows furthermore that poles may lie on 
the imaginary axis but that such ix)les must be simple (s — \) and the 
function Z(X) must there have {x^sitive real residues (/!? = 0) so that, 
for the immediate vicinity of such a pole, Eq. 695 becomes 

k 

/^c[Z(X)] ^ - cos 0 [696] 

P 


which remains positive for — 7r/2 < 0 < 7r/2, that is, for values of X in 
the right half plane. 

Since the same conclusions apply also to the reciprocal function 1 Z(X), 
one recognizes that the zeros of the ix)lynomials FCX) and Q(X) must 
haw real which are not jxysitivc. That i.s, P(A) c/z/cZ Q(\) zzznsI be 

Iluruitz polynofttidls.* 

.M<jre e xplicitly, the expression for Z{\) may be written 

, P(X) do 4- diX d- -f • • • 4- tlnX"* rA07l 


Z(X) = 


/fiX 4 /^jX” -f- ' ' ’ "4 t^^/^X' 


If the degree // of P(X) /s higher than the degree m of (>(X), then Z(X) 
has a \yo\e at X ^ ^ . Since this \x>int may be regarded as lying on the 
imaginary' axis, such a ix)k\ it present, must he simple. Similarly, if 
^ ivmcti " ^ '' ' ' " \ = 0 it = 0; it has 

a [K)/e of sea)n(l order at / ’ Since the 

'\>o be simple. 
Mcal of Z(\K 


X = t) Vu-s t>n f •' imat 
Recoi^ni^ins^ that fh<' - o??#* r/ 

*UK- Mfs that tin- [»<> ii-. 

re>trit lion that //:c Uk. 

I* ' \ ) (Did ih \ ) ( dr''( 

In examining a gi\i- 
whether or not it is p 
F(X) and are Hu it/ |m>1 

as their highest |)<>wers e.lTcr l)y n* 

*'nie term Ilun^itz polytiomicJ .v u ol i 
.A-rtis on tile imaginary a.xi'S. 
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sion 689 for the real part of Z{\) and examine its behavior over the 
entire right half of the X-plane, however, is a laborious procedure which 
one would like to avoid. In this regard it is found helpful to make use 
of the theorem (discussed in Art. 21) which stales that if a function is 
analytic within and on the boundary of a given region, the maximum 
and minimum values which the real and imaginary parts of that function 
assume on the boundary are maxima and minima for the enclosed region. 
This region, in the j^resent problem, is taken to be the right half of the 
X-plane; its boundary is the imaginary axis. 

If Z(X) is a tx)silive real function, it is analytic in the right half plane, 
and if, for the moment, one assumes that Z(X) has no i:)oles on the imagi- 
nar>’ axis, the theorem just cited assures that the smallest value which 
the real pi^rt of Z(X) assumes on the imaginary axis must be smaller 
than any value which this real part may have over the entire right half 
plane. Conversely, if Z(X) is analytic in the right halt jdane and on the 
imaginary axis, and if the real part of Z(X) on the imaginary axis is 
nowhere negative, this real part must remain |x^sitive over the entire 
right half plane, and Z(X) must be a positive real function. 

The stipulation that Z(X) be analytic on the imaginary axis may be 
dispensed with, for if Z{X) has poles on the imaginary axis, it is merely 
necessary to modify this boundary by inserting vanishingly small semi- 
circular detours at such j')oles, so that the resulting boundary avoids 
these points by i)assing slightly to the right of them. As shown above, 
the requirement that the real part of Z{\} shall remain i)ositive on a 
small semicircular detour is taken care of by the stipulation that jx^les 
on the imaginary’ axis be simple and that the residues of Z(X) at such 
poles may be real and ix)sitivc. 

The necessary" and suflicient conditions that a rational function Z(X), 
which is real for real values of X, be positive real may thus be stated in 
a form which does not require an investigation of the real part of Z(X) 
over the entire right half plane. Such a statement reads: 

//Z(X) is analytic in the right half plane, and 

if on the imaginary axis this function has only 

simple poles uith positive real residues, ZfX) [698] 

is a positive real fufwtion if Re[Z(ja/)] ^ 0 

for all real values of w. 

In order to form the real part of Z(X) for X = ju, one may begin with 
the expression 697 and in each of the two tx)lynomials groUp the terms 
with even and odd powers res[x;ctively. That is, the polynomials are 
written 


P(X) = JWi(X) + »i(X) 
2(X) = nizOC) + n2{\) 


[ 699 ] 
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in wliich Wi and mo are the terms involving even powers of X, whereas 
n\ and no are the terms involving odd powers of X. It is clear that for 
X = 7 ( 0 , and mo are real, whereas n\ and no are imaginary. A process 
of rationalization applied to Z(X), therefore, yields 


Z(X) = 


(wi + til) (mo 
{nio d- no)(mo 


Ho) 

no) 


[700] 


from which it is clear that* 


\ W72 - «2“ /\=j. 


[701] 


d'he denominator in this expression represents the square of an abso- 
lute value and hence is surely i)ositive. The condition Re[Z(jcA))] ^ 0 is 
thus seen to be expressec^ by 

(niinio — nifio) ^ 0, for X = [702] 


If i'^^XJ and (2(Xj arc assumed to have the same degree n (this assump- 
tion does not restrict the generality of the present argument), it is clear 
that (tninio — niHo) is a polynomial of the degree n in the variable X“. 
One may, therefore, write 

{minio — wp/j) .1 (\i‘ — X“)(X_»* — X“) • • • (X„" — X“) [703] 


or, for X = jojy 

{mivio — niHo) = -d(Xi' + w‘)(X2“ + w") • • • (Xn“ + co“) [704] 

I'hc constant .1 must evidently be positive if this expression is to be 
positive for all values of since it must still be positive for w — ► x . The 
X“-robts, which are denoted by Xi“, X 2 “, • • • X,,^, may be complex as 
well as real, but since the {X)lynomial 703 ha- real coetlicients, any 
complex roots, if present, must occur in conjugrite pairs. Such a pair of 
roots leads to a pair of conjugate complex factors in the expression 704, 
and hence yields a resultant factor which is the square of an absolute 
value. C'omplex as well as {X)sitive real X“-roots, therefore, yield 
factors in the expression 704 which are surely positive for all real 
values of w. This statement is still true if some of the real X"-roots are 
zero. A negative real X“-root of even multiplicity leads to a factor in 
the expression 704 which is raised to an even power, and hence such a 
factor is also surely positive. However, if there exists a negative real X“- 
root of odd multiplicity (for example, a simple root of this sort), the ex- 
pression 704 is surely negative over some part of the range 0 < (*?“ < oo. 

*The functions w(\) and n{\) should not be confused with M(<r,w) and appearing 
in Eqs. 688, 689, and 690. m and n become identical with u and jv respectively for X = 
that is, only for (t = 0. 
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It thus becomes clear that the necessary and sufficient condition insuring 
Re[Z(ja))] ^ 0 is simply that, in addition to being positive for = — oo, 
the polynomial (mim 2 — niUo) shall have no negative real \^-roots of odd 
multiplicity. The fulfillment of this condition together with an assurance 
that Q{\) is a Hunvitz ix)lynomial sufiices to prove that Z(X) is a positive 
real function provided (?(X) has no zeros on the imaginary axis. If it has, 
one must also establish that these zeros are simple and that the residues 
of Z(X) there are real and positive. If Z(X) is found to be a positive real 
function, one may be sure that P{\) is, of course, also a Hurwitz ix)ly- 
nomial. 

If Q{\) has zeros on the imaginary axis and P{\) does not, it is easier to 
establish the positive real character of the reciprocal function 1 Z(X) 
because the latter then has no jx^les on the imaginary axis although 
Z(X) does. When 1 Z(X) is found to be jxisitive real, it follows without 
further proof that Z(X) is |X)sitive real also. 

A limiting form of the jx^sitive real function Z(X) results if its real 
part is identically zero for X = According to Kq. 701 this situation 
requires the condition 

{niinio — niUy) ^ 0 [705] 

in which X need not be restricted to pure imaginary values.* If, for the 
moment, none of the polynomials ;wi, W 2 » are considered to be zero, 
one may write the condition 705 in the form 


mim2 



[706] 


a representation for the polynorm'al Wi that is possible only if wqwo con- 
tains «2 as a factor. Since /?/> + ^2 is a Hurwitz jx)lynomial, the parts 
W 2 and «2 have no common factors (the discussion of the preceding 
article shows that the zeros of nio and Un separate each other on the 
imaginary axis). Hence the identity 706 can be fulfilled only if 

mi = and /q = W 2 X'’ [707] 


the factor X^ arising from the obser\^ation that one may multiply nu- 
merator and denominator on the right-hand side of the identity 706 by 
any power of X. The functions m and n being respectively even and odcl, 
it is clear that the integer p is odd. 

The results 707 now yield for the impedance 


Z(X) = 


(m 2 + n2)yd* 

m 2 + «2 


[708] 


*It should not be inferred from Eq. 701 that, if W 1 W 2 — W 1 W 2 vanishes for all values of X, 
so does the real part of Z(X), for the expressioO 701 represents the real part of Z(X) only for 
X = juif not for X-values in the complex plane. 
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in which one can insert a constant multiplier if desired. Since a pole of 
Z{\) at infinity must be simple, one has /> = 1. The function Z(X) thus 
obtained is trivially simple. 

The only other conclusions permitted by the identity 705, if Z(X) is 
not to become identically zero or identically infinite, read 


nil = 0 and «2 = 0 

[709] 

;zi = 0 and = 0 

[710] 


Correspondingly the impedance becomes 


Z(X) = 


;/i(X) 

W2(Xj 


or 


Z(X) = 


mx(\) 

ni{\) 


[711] 

[712] 


According to the discussion of Hurwitz polynomials given in the pre- 
ceding article it is seen that these functions Z(X) have simple zeros and 
poles restricted to the imaginary axis. Thus the special form of a positive 
real Z(X)-function whose real jiart is identically zero for X = jo) is the 
same as that sjiecial function whose poles are restricted to the imaginary 
axis. One may say that if Z(X) is a positive real function whose real part 
is identically zero for X = the zeros of both P{\) and ()(X) must lie on 
the imaginary axis. 

Since Z(X) is by definition a positive real function, the residues in all 
its jMiles are real and positive. A partial fraction expansion of such a 
Z(X)r function reads 




kn k± 

+ r7V + ^_:r + 5- -+---+*2pX [713] 


The terms in this expression, except for the first and last, are pairs of 
conjugates. Since the poles are restricted to the imaginary axis, the conju- 
gate of X^ is —X*,; and since the residues are real, conjugate poles (which 
normally involve conjugate residues because Z(X) must be real for real 
X s) yield identical residues. The first and last terms in Eq. 713 represent 
{Kissible poles at X = 0 and X = « . 

Combining conjugate complex terms in the expression 713 gives 

■Z(X) = Y + - Xo- X 42 ^ [714] 

Since 

X, = dry'w. and X,^ = —u>,^ [715] 
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one finds 


. N ^0 , 2k2jo2 2A’4 /w 

Z{]0i) — “ H H -f 

J<ji 402 “ ~ w” 0 ) 4 “ — a)“ 


+ k2pj(t} [716] 


This special form of the function Zi jo)) is of practical interest because it 
represents the impedance of a lossless network. I'he function Z(X) in this 
case is evidently on the borderline with resj)ect to the property of being 
positive real. 

In terms of the factored forms of the pcfiynomials P{\) and (^(X), this 
function Z[joj) has a representation of the fonn 


zu^) 


yco(aj“2 


— cj") • • • (a,'*2p— 1 w”) 


[717] 


in which II is a positive real constant. 

Even in this special limiting case the real part of Z(X) is zero only on 
the imaginary axis of the X-plane. For complex X-values one has 

Z(X) = RiCyO}) + jX(a,u) [718] 


in which R and A' are the real anri imaginary parts. Acconling to the 
Cauchy-Riemann Eqs. 12 and 13, 


dR _ 

dX 

dR 

dX 

da 

do3 

0o3 

da 


[719] 


Since R(cr,oj) is identically zero for <7 = 0 and }X)sitive for (t > 0, it 
follows that dR d(T is jx)sitive and hence that dX du) is iK)siti\T for 
(7 = 0. In other words, when all the ix>les of Z(X) lie on the imaginary 
axis and its real part there is identically zero, one has 



dZ{ Ju) 
j dcj 


> 0 


for 


OC < CJ < « 


[720] 


A similar relation then also holds for the reciprocal function. 

This result states that the real function Zijuj) j of the real variable 
03 has a positive slope for all values of 03. As a consequence it follows that 
the zeros and poles of Z(X), which are all simple and lie along the imagi- 
nary axis, must mutually separate each other. This alternation of zeros 
and poles may (with reference to the notation in Eq. 717) be expressed by 

0 < a>i < W2 < • * * < W2p-2 < W2p — 1 < ^ [721] 

It is significant to observe that a positive real function in this limiting 
case has properties identical with those of the function ^( 2 ) discussed in 
the previous article (see Eq. 625 and following). A function of this type 
is represented by the reactance or susceptance of a lossless electrical 
network. One may, therefore, state that the polynomials whose ratio 
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represents the reactance or the susccptance of a lossless network are the 
even and odd parts of a Hurwitz fX)lynomial. The converse of this state- 
ment is likewise true. When the expression 688 represents such a re- 
actance or susccptance function, one of the polynomials P(X) or ^(X) is an 
even and the other an odd function of X, as shown by Eqs. 711 and 712. 

Returning to the general case, one may discover additional useful 
properties of the positive real function by considering its linear fractional 
transformation 


1 - Z(X) ^ Q{\) - P{\) 
1 -f Z(X) ^(X) + P(X) 


[722] 


As shown in Art. 24, this transformation maps the right half of the 
Z-plane uix)n the interior of the unit circle in the r:-plane (and vice 
versa), the imaginary axis of the Z-plane becoming identified with the 
unit circle in the ::-planc. If Z(X) is a positive real function, points in the 
right half of the X-plane correspond to points in the right half of the 
Z-plane and hence to ix)ints within the unit circle of the >plane. One 
may state 

//Z(X) is positive real, [z(X)| ^ 1 for Re(X) ^ 0 [723] 

It is readily appreciated that the converse must be true also; that is, 
If |Z(X)j ^ 1 for Re(X) ^ 0, then Z(X) is positive real [724] 

^loreover, if Z(X) is positive real, then ::(X) must be analytic in the right 
half plane and on the imaginary axis, for 1 -f Z(X) cannot be zero there. 
According to the principle of the maximum modulus (discussed in Art. 
21), the largest value which |-(X)| assumes on the imaginary axis of the 
X-plane must be a maximum for the entire right half of that plane. 

Conversely, one may state that if s(X) in Eq. "22 can be shown to be 
analytic in the right half plane inclusive of the imaginaiy’ axis, and if on 
this boundary |-(X)| ^ 1, this inequality must hold for the entire right 
half plane and, according to the statement 724, Z(X) must be positive real. 
It is now interesting to consider the function 

Z-W - ^ [725] 

in which 

P*(X) = Wi(X) + 

^(X) = »i2(X) + Wi(X) 

Comparing these relations with Eqs. 699, one observes that the poly- 
nomials P*(X) and formed from P(X) and Q(\) by interchange 

of their odd parts. In terms of Z*(X) one now forms the expression 
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analogous to 722, namely, 

^ 1 + Z*(X) ()*(X) + P*{\) 

According to Eqs. 699 and 726 it is readily seen that 
Q*{\) + P*{\) = Q(\) + Pi\) 


[727] 

[728] 


Since it has already been shown that the {xdynoniial P{\) + (^(X) has 
no zeros in the right half plane or on the imaginary axis when Z{\) is 
{X)sitive real, one obser\’es in that event that c*{X) is analytic in the entire 
right half plane inclusive of the imaginary axis. 

From the mapping properties of the function 727 it follows that !c*(yw)| 
^ 1 for Re[Z*(ji>3)] ^ 0. The latter ineciuality is found, from Kqs. 725 
and 726, to be expressed by 


. mo" - nr A 


[729] 


or, since {ni 2 “ — Wi^) for X = ju) is the square of an absolute value, one 
has more simply 

(miPio — nifio) ^ 0, for X = joi [730] 


This is the condition 702, which is fulfilled if Z(X) is positive real. In 
this event the present argument is thus seen to yield \z*{ju3)' g 1, and 
since 2 *(X) is analytic in the right half plane inclusive of the imaginary 
axis, the principle of the maximum modulus enables one to conclude that 
| 2 *(X)| ^ 1 throughout the entire right half plane. According to the 
statement 724, therefore, the function Z*(\) is then seen to be jxisitive 
real. This conclusion is summarized by the statement: 


If the rational function Z{X), given by the 
quotient of polynomials P(X) and Q(X), is 
positive realj the rational function u^hich 
results after the even or odd parts of these 
polynomials are interchanged is also posi- 
tive real. 


Since Z*(X) is positive real, it follows that P*(X) and ^(X) are Hurwitz 
polynomials. Hence one may state 

If the quotient of polynomials P(X)/Q(X) 
is a positive reed function, not only these 
polynomials hut also those which result [732] 

from an interchange of their even or odd 
parts are Hurwitz polynomials. 
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Further practically useful results are obtained from an investigation 
of the properties of positive real functions in polar form. The first step 
in this direction is the introduction of the relations 


3(X) 


\ - A 
X -f- A 


or 


X = 


A • 


1 -h s 
1 — 3 


[733] 


" Z(X)“ + B 


[734] 


The positive real constants A and B have any values satisfying the 
relation 

Z{A) = B [735] 

By means of the transformation 733 the interior of the unit circle in 
the 3-plane is ma{)jK‘(l u[X)n the right half of the X-plane. The positive 
real function Z(X) relates ix)ints in the right half X-plane to points in the 
right half Z-plane, and these in turn are mapped upon the interior of the 
unit circle in the tc'-plane by means of the transformation 734. As a 
consequence of the relati(ni 735 between -1 and B^ the origin in the s-plane 
corrcsjx)nds to the origin in the u'-plane, that is, 

ic(0) = 0 [736] 

The function Z and its independent variable X are now represented in 
the polar forms 

X = and Z = [737] 

In, terms of the variables thus introduced, Fig. 51 illustrates the identical 
transformations 733 and 734. Part (a) shows how the concentric circles 
(p = constant) and the radial lines (0 = co:.istant) of the right half 
X-plane appear inside the unit circle of the 3-plane, and part (b) similarly 
illustrates the api>earance of the polar representation of Z within the unit 
circle of the tc-i)lane. In each case the unit circle itself represents the 
imaginary axis of the X- or Z-plane, the left-hand point on the circle 
(- = -u' = — 1 ) corresponding to the origin in either of these planes and 
the right-hand one (3 = Zc’ = 1) to the point at infinity. 

The imaginary axis of the X-plane thus corresponds to |3| = 1. There- 
fore, if Z(X) is a positive real function, points on the locus |s| = 1 yield 
Z-values which are in the right half or on the imaginary axis of the 
Z-plane and hence \\T[thin or on the unit circle of the Z£^-plane. That is, 
one may state that 

|z£;| ^ 1 for \z\ = 1 [738] 

In view of this result and the condition 2£;(0) = 0, as expressed by 
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Eq. 736, it is recognized that Schwarz's lemma (see Art. 21) enables 
one to make a considerably stronger statement, namely, that 

|a'i ^ H for ';r:| < 1 [739] 

in which the equals sign holds only if it holds identically. 

This result is readily translated into an expression involving X and 
Z(X) since it states, with reference to Fig. vSl, that to any concentric 
circle within the unit circle of the ::-plane there corresponds a concentric 


e-plane u;-planc 



Fig. 51. Relevant to the derivation of the properties of a p. r. function expressed 

in polar form. 

circle within the unit circle of the Ti’-plane which is at least as small or 
smaller. If one considers the value p = -4, which implies no restriction 
since the value of A is arbitrary, one obser\'es by inspection of the figures 
that the following condition obtains: 

|0| ^ \4>\ for |<^| g 2 L740] 

This result may be written in the alternate form 

iarg. Z| ^ [arg. X| forO < |arg. X| g ^ [741] 

Again the equals sign in the first of these inequalities holds only if it 
holds identically. 

The remarkable part about this result is that although it appears to be 
stronger, it nevertheless is contained in the statement Re[Z{\)] ^ 0 for 
Re{\) ^ 0, expressing the positive real character of Z(X). Since the 
present result is readily seen to include this statement in terms of the 
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real parts of X and Z(X), one concludes that the two forms of expressing 
the fxisitive real character of Z(X) are entirely equivalent. The statement 
in terms of the angles of X and Z(X), just derived, is a translation of the 
one in terms of real parts into its equivalent polar form. 

An additional inequality may be obtained from these considerations. 
With reference to the transformation 7v33 and the polar representation 
737 for X, let p = A. d'his choice is always possible since the value of A is 
arbitrary. Equation 733 then yields 

- 1 


+ 1 


• 4- ^ 

./tan- 


The condition 739 may therefore be written 

|:c| ^ tan ^ for 0 < </) < ^ 

Using the inverse of the transformation 734, which reads 

^ 1 -f“ 

B ^ 1 


and taking note of Eq. 735 ,one obtains the condition 
</> 


1 — tan 


<t> 

l+tan- 


1 -f tan 


<t> ^(p) 


1 — tan 


<t> 


for 0 < 0 < “ 


[742] 


[743] 


[744] 


[745] 


For <t> = 7r/2 the lower and upper limits in this relation are zero and 
infinity, but for 0 = 7r/4, for e.xample, one has 

0.414 g 2.411 [746] 

lkp) 

The restriction implied by the condition 745 is thus seen to illustrate 
another interesting prop)erty of positive real functions. 

A property which is rather obvious, but nevertheless practically useful, 
is expressed by the statement: 


If Z(X) atid W(X) are positive real func- 
tions, f(X) = Z(W) is again positive real. 


[747] 


In other words, a positive real function of a positive real function is 
also positive real. As an example one may consider the simple positive 
real function 

H'(X) = ] 


[748] 
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The above statement asserts that if Z{\) is positive real, Z(l/X) is also 
positive real. Again, one may have 

n-(X) = X + J [749] 

which is readily recognized as being |X)sitive real. Then if Z(X) is positive 
real, one may state at once that Z(X 4- [1 X]) is likewise jx)sitive real. 
Use of the condition 741 enables one to make the further assertion that 
the angle of the function Z(X + [1 X]) must, for every value of X in 
the right half plane, be smaller than the angle of Z(X). 


PROBLEMS 

1. Verify that the real and imaginary pari^ uf the function 


cos y + jc^ sin y 

satisfy the Cauchy-Riemann equations. Show that this function must be r where 
s = X -f iy. 

2. Using this reasoning, show that 

sin :i* cosh y -h / cos x >inh y - >in c 
cos .V cosh y -f j sin x sinh y = cos c 

3. Let the sphere in stereographic projection be of radius 1, so that its equation is 

e + = 0 


Show that the point f, 77, f on the sphere corresix)nds to the point 


X 


2 -f 




277 

2 -f 


in the plane f = 0. Show that if 22-1 = —4, z\ and 22 correspond to diametrically 
opposite points. 

4. For It; = sin 2 = w sketch the curv’es ii = constant and v = constant in 
the z-plane and verify that they are orthogonal. 

5. Actually integrate dz around the following contours, and verify that the results 
are zero: (a) Around the square having vertices at 1 -f 7, 1 - 7, -1 — 7, - 1 -h 7. 
(b) Around the triangle having vertices at 1 + 7, — 1 — 7, - 1 + j. 

6. Carry out the integration for (I/2) dz about the following contours: fa) The 
square widi vertices at 1, 2, 2 4-7, 1 4*7. (b) A circle of radius 1 about the origin as 
center. Why is the result not zero in part (b) ? 

7. As an example of analytic continuation, consider the following: (a) Find the 

function represented by 1 4- 2 4* 2^ * * * . (b) Determine its circle of convergence, and 
find the singularity on the circumference, (c) Determine the series for this function 
about the point z = (d) Verify that this series converges for points for which the 

original series diverges, in particular at the point z = - L 

8. As an example of a function having a natural boundary, consider 

/(z) = 1 4- 2^ 4- 2^ 4- 2** 4- 2*® H 
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(a) Show that the radius of convergence is 1. (b) Verify that for every point of the 
form k and p being integers, all terms after 2 ^^ will be 1, so that the function is 

singular at such points, (c) Show that these points are dense on the unit circle, in the 
sense that there is no interval free of them. As a consequence the function cannot be 
continued outside. 

9. In deriving the Laurent series for a function, the integral formula for the coef- 
ficients is not practical. The function 

sin 2 

has a pole of order /r — 1 at the origin, and its Laurent series would be found by 
dividing the series for sin z by 2 *. Using this idea, find the Laurent series for 

cos 7r2 

(1 

about the point 2 = 1 . 

10. In deriving a Laurent series, the method of partial fractions is useful. For 
example find the Laurent series for 

J 

2“(l — 2 ) 

about 2=0 and 2 = 1. What are the regions of convergence? 

11. Sketch the lines u = constant and v = constant near the origin for the functions 

it' = 1 -f- 2 ‘‘ and w - \ 
which have saddle points there. 

12. Find the value of the integral of esc zdz taken around the following contours: 
(a') .\ circle of radius I and center at the origin, (b) A circle of radius 4 and center at 
the origin, (c) A circle of radius 2 and center at 2 = 2. 

13. Find the integral of sin 2 2 * for = 1,2, 3, and 4 taken about a circle of radius 
1 and center at the origin. 

14. Determine the polar and rectangular forms of the function 

^ ^ sinh ;/2 
"" ;z sinh 2 


in which n is an integer, and show that the Cauchy-Riemann equations are satisfied. 

15. Show that f{z) in the foregoing problem is an entire function, and determine 
the distribution of its zeros. 

16. Suppose the derivative of /(z) = w(x*,y) -\-jv{x,y) is written 

-f a +jb = v'<j2 
dz 


If f{z) satisfies the Cauchy-Riemann condition equations, show that one may have 


a 


du 

dx 


and 



or 


dv 

a = — and 
dy 


b 


du 

dy 


For the following functions 

2 "; coshz; sinh 2 ; 


In 2 ; 


cos '2 
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compute the polar form of df/dz through hrst forming u and :• and obtaining a and b 
by one or the other of the above pairs of relations, and alternately through forming 
df dz directly and then putting the result into its polar form. Check the three results 
for each function. 

17 . Derive the following identities: 

(a) smh jz = 7 sin c (b) sin>- = 7 sinh c 

(c) coshes = coss (d) cos 7c = — coshc 

(e) tanh jz = 7 tan s (f) tan jz = 7 tanh c 

(g) sinh“^72 = 7 sin“* c (h) sin“^7S = 7 sinh“* z 

(i) cosh“^ s = 7 cos~* s (j) cos\\~^ jz = j cos''^ jz 

(k) tanh“^7S = 7 tan“^ c (1) tan“*7S =7 tanh'^s 

(m) sinh"^ s = In (s -f \ s- -f 1 ) (n) cosh“^ 2 = In (c + \ c“ ~ 1) 

(o) tanh-' : = ^ In (P) coth"' s = In 

(q) sin-* s = -. In (jz + — z-) (r) cos"' s = -. In (c + — 1) 

(3) tan-' z = 1 In (t) cot- = = In 

18 . Considering the function = In 2, draw in the ^4’-plane the figure corresponding 
to the rectangle in the 2-plane defined by 

(.V = - 3 ; 1 ^ y ^ 2 ) (a* = 3 ; 1 ^ y ^ 2 ) 

r- 3 ^.T^ 3 ; y = l) (-3 ^ .x g 3 ; y = 2 ) 

19 . In a complex plane draw the figure bounded by arcs of concentric circles with 
radii ri = 10 and r2 = 30 centimeters, and radial lines making angles of 30 and 60 de- 
grees with respect to the positive real axis. Using the function u' =-- In 2, transform this 
figure into a rectangle and specify the equations of the straight lines forming its sides. 

20 . Considering the function w - sin“^ 2 = 1/4- jv, show that the lines u = 
constant and v = constant in the it’-plane are transformed respectively into central 
hyperbolas 

yL = , 

sin^ u cos‘ u 

and central ellipses 

- ^ , 

cosh^ V sinh^ v 


in the 2-plane, and demonstrate that these families are confocal and orthogonal. Draw 
a representative number of these loci. 

21 . With reference to the conformal map described in the previous problem, con- 
sider the strip — 7r/2 < u < ir/l in the w-plane and determine the regions in the 
2-plane corresponding to the portions of this strip defined by v > 0 and v <0 respec- 
tively. The closed rectangular boundary in the w-plane joining the four points « = 0 , 
v]=^ 0.002; w = 1, z> = 0.002; w = 1, v = 0.02; w = 0, v = 0.02 is traversed counter- 
clockwise. Determine the corresponding contour in the 2-plane and indicate the 
direction of traversal. 
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22. Continuing the study of the function w = sin“* z, describe the structure of its 
Riemann surface pointing out the location of branch points, appropriate positions of 
branch cuts, and the number of leaves in the surface. How many leaves of the Riemann 
surface are occupied by the closed contour in the z-plane corresponding to the rec- 
tangular one in the w-plane joining the four points: 

Sir Sir Stt ^ ^ Stt 

= 1; M=— = = W = - — , = 10 

23. Consider the function w = 1/z and show that it transforms circles in the 

z-plane into circles in the U'-plane and vice versa (including straight lines as limiting 
forms of circles). Draw a family of circles in the u'-plane corresponding to the straight 
lines y = kx -f c (with k and c real) in the z-plane, choosing a fixed value for for 
example, k = J-, and various values for c as, for example, c =0, ±1, ±2, etc. Alter- 
nately consider the families of straight lines x = constant and y = constant. 

24. For the function w = z/ (a — z) with a = 1 -f map the loci in the z-plane 

corresponding to the lines i' = 3^^ + = 3^, — 1 in the a.'-plane. Indicate 

corresponding directions of traversal by placing arrows on the loci. By shading indicate 
the regions in the z-plane corresponding to (a) that part of the 2 i^-plane below the line 
V - 3'2M, (b) that part of the ii;-plane below v = 3^^ and above the real axis. 

25. The arc of a circle of radius 10 and center in the first quarter of the z-plane 

passes through the points x = 0, y = 0 and x = 3, y = —1. Show that the region 

enclosed by this arc and the chord passing through the same two points may be 
mapped in the :t’-plane as an angular slit with its vertex at the origin by means of the 
transformation u* = s/(a ~ z). Determine the complex value of the constant a and 
calculate the angular aperture of the slit as well as its orientation in the li’-plane. 

26. \ region A in the first quadrant of the z-plane is bounded by the arcs of three 
circles passing through the origin, two of them having their centers at the points 
X = 1, y = 0 and x = 2, y = 0, whereas the center of the third circle is at the point 
X = 0, y = 2. A second region B is bounded by the arcs of two circles passing through 
the origin with centers at x = —2, y = 0 and at x - 0, y = — 1. Show that each region 
can be transformed into a rectangular strip in the u^^-plane through the transformation 
w '= 1/z and determine the boundaries of these corresponding rectangular regions. 

27. Show that the equation 

A (x^ + y “) -t- Bx -f Cy -f L = 0 

in which the constants are subjected to the condition 

52 + C2 - AAD > 0 

is a general equation for circles and straight lines, and that this property is preserved 
when z * x is replaced by 1/z. 

28. For the transformation 

1 — z 
w « 

1 

let z = and consider the concentric circles r = constant and the radial lines 
6 =* constant, and compute the location of the center and the radius of each corre- 
sponding circle in the w-plane in terms of r and 6 respectively. Plot a representative 
family of curves in each plane. 

29. For a function w ^ J{z) - u satisfying the Cauchy-Riemann equations, 
loci for u = constant and v - constant are plotted in the z-plane. If the slopes of these 
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curves are denoted by (dy = const ant and {dy t/.r)r -constant respectively, show that 

at any point these derivatives have negative reciprocal values and hence that the 
families of curves intersect at right angles. 

30. Study the function u* = vicinity of the point Zq - .^ 2(0 + b) 

through plotting in this region a number of curves corresponding to u - constant and 
V = constant for a =5 4- jO and h = 0 

31. Let Vz = du dx and : = du dy be the velocity components of a conservative 
hydrodynamic field, u being the real part of the analytic function w = /(z). Show 
that the magnitude of the velocity is given by ? = \df dz\. 

32. Write down the mutual conditions to be satisfied by two functions P(x,y) and 
Q(.v,y) if the line integrals 

J{Pdx-Q dy) and jT (() dx + P dy) 


are to be independent of the path c , and compare the results with the Cauchy-Riemann 
equations. 

33. 5 is a path joining two points .1 and B in the complex s-plane, and Zi, 22 , * * * f 
Zn = B are ;i uniformly spaced points along this path. The definite integral along the 
path is defined as 



z 

A - I 




Az,, 


with Az/. 


-A -i 


Show first that 


and thus that 


. /.• = 1 A- = I 


34. With reference to the situation given in Prob. 33, suppose /(z) is bounded along 
5 and that its largest absolute value on this path is M. Let the length of the path 5 
between A and B be denoted by L. Show that 



^ ML 


35. Let /(z) be regular on and within a closed contour C. The length of the contour 
is £, z is an internal point, and L' is the circumference of a circle whose radius is the 
minimum distance from z to the contour. By means of Cauchy’s integral formula, 
show that 

ML 

|/(z)| ^ ^ {M defined in Prob. 34) 


36. By direct integration along the sides of a rectangular contour joining the points 
; (x -h JCo,y) ; (x + xo,y 4* yo) ; ix.y 4- yo) check the relation 


f 


e^dz =0 
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37. Consider integration of the function e^/z around a closed contour formed by 
two semicircles in the left half plane concentric at the origin, one having a large radius 
i?, the other having a small radius p, and those portions of the imaginary axis joining 
the semicircles. The integral is effectively expressed as the sum of four parts corre- 
sponding to the two linear path increments and the two semicircular ones for which 
z = Rc^^ and z = pc-'^. Observing that the function e^jz is regular on and within this 
contour, obtain in the limit i? , p — » 0, the result 



38. Compute the definite integral 



f 


along a path consisting of the linear increment from the point 1 to the point |z| on the 
positive real axis, followed by a circular increment concentric at the origin, expressing 
the result as the sum of two parts corresponding to these path increments. Thus show 
that the value of the integral equals In z. 

39. Consider integration of the function ti» = around a closed rectangular 
boundary joining the four points z = —< 2 , z = u, s = u -f 7 ^, 2 = —a + jh, a and b 
having positive real values, and express this integral as the sum of four parts corre- 
sponding to the four linear path increments. Using the result 

dx = \ \ TT 



obtain in the limit a 


00 the more general one 



cos 2bx dx 



40. Integrate the function e~'‘ around a closed contour formed by the linear incre- 
ment from 0 to R along the positive real axis, followed by the circular arc from R to 

and completed through a linear increment from to 0. Considering the 

limit R -* X and again using the value of the error integral given in Prob. 39, obtain 
the result 

X 

which, with the substitution x- = u, yields the Fresnel integrals. 

41 . Expand the following functions in Maclaurin series: e^y cos z, sin z, cosh z, sinh z. 

42. Through expansion in Maclaurin series, check the following expressions: 

^2 -3 -4 


COS dx 


= fo = 


(1 + *)" = 1 + nz + ** + 


n(n - \) (ft - 2) 3 


1-2-3 


z" -h • 


43. Using the result that the radius of the convergence circle of the power series 
/(«) = ao 4- diz -f 02Z- H -h GnZ^ 4- • • • 
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is given by 


= limit 

*Lv \iJn\j 


show that R = x for the series in Prob. 41 and R = \ ior those in Prob. 42. 

44. Find the radius of convergence of the following geometric series: 1 

_i_ 2-3 ^ 

45. E.xpand the function (u- — 1)“- in a power series about the points 2=0, 
z = j, z = and find the radius of convergence in each case. 

46. Utilizing the fact that the convergence circle reaches to the nearest singularity, 
compute the radius of convergence of the function [(c — I ) (2 -f 2 — j^)(z‘ -}- 9)]“^ 
e.xpanded in a power scries about each of the points: 2 = 0 , 2 = 0.2 — z = 72 , 
2 = - 8 . 

47. .\ssuming that the function given in Prob. 46 is expanded in power .series about 
points on the line y = — 2 .v lying at .v = 1 . 0 , —1. — 2 . make .sketches showing the 
various regions of convergence and their overlapping portions. 

48. Show that the expansion of the function 


Jo 




about appropriate points zq on the positive real axis reads 

r/ V ^ (Z - 2o)"(-l)” 

and verify that the radius of convergence is i? = |zo — l|. 

49. Using the result 


Jo * 


find the values of the following integrals: 


X’-' 


cos yi dt 


for X >0 


sin yt dt 


50. Show that the function 1/(1 + v^z — 2) has two different power series expan- 
sions about the px)int 2=0 with radius of convergence equal to 1 and 2 respectively. 

51. Using the formula for the coefficient of the Laurent series, and assuming 
that the points f and Zq are not functionally related, obtain the relation 

. -1 db„ 

n + l dzo 

applicable for negative as well as positive integers n. 

52. Consider the rational function 

PM 

in which Pi and P 2 are finite polynomials. Assume that P%{z) has a zero of multiplicity 
Of at 2 = Zo and write Pziz) = (z - 2 o)®Pj(z). Letting f - zo = pe^ in the formula 
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for Laurent coefficients and recognizing that p may become arbitrarily small, obtain 
the result 

' i^2(20) 

Also show that the formula yields 6_(a+p) = 0 for all positive values of p, and using 
the result of Prob. 51, get 

. 1 , 

(a — l)(of — 2) • • * (a — />) ^z^2 

which contains the specific result 

1 

" (a - 1) 

53. Considering the Laurent expansion of the rational function described in Prob. 
52, show that the coefficients /)_« • • • 6_i have real values when Zo is real and that they 
have conjugate complex values for conjugate values of zq. 

54. Show that the Laurent expansion of the function 

f(z) = 

about its essential singularity at s = 0 has coefficients given by the formula 

1 r-' 

bn{t) - -- I COS (}uP — t sin (p) d<f} 

Jo 

which arc Bessel functions of the first kind. 

55. Suppose f(z) has an isolated singularity at s = Zq and the radius of convergence 
for its Laurent expansion about this point is R. For points on a concentric circle about 
So with radius r < R show that the series representation takes the form 

/(s) = .'lo + ^ I (-4,, + *4_n) cos fup “ .4_,0 sin fup} 

n - I 

in which 

^ /(zo + re^‘^)<-' d<t> 

56. Find Laurent expansions for the following rational functions about their poles: 

5s^ + 3s -f- 2 1 3s 

4s“ + 8s + 1 4s- + 8s -h 1 4s- -f 8s + 1 

57. Find the partial fraction expansions of the functions given in Prob. 56. 

58. A polynomial P{z) has a double zero at s = 1, a zero at s = j, and a zero at 
z = 3(1 —j). Assume the coefficient of the term s'* to be unity. Find the Laurent 
series for \/P(z) about s = 1. 

59. Compute the residues of the following functions in their various poles: 

5s^ + s -f- 1 2 * + 

s(z^ -f 3s + 8) s* — s 

60. A rational function /(s) has simple poles at the points s = si, Z 2 , • • • Zn only, 

with the residues respectively. Find analytic expressions for the residues 
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of the function 




j7u) 


— S ) 

in all its poles assuming that f does not coincide with any of the points Z\ • 

61. Continuing Prob. 60. evaluate the contour integral 

lirj , 


•Zn. 


r/ Jr 


in which C is a circle about the origin with a radius R sulViciently large to enclose all 
the tholes of ypiz). li f{z) is regular at infinity, >how that 



0 as R -* CO 


and hence that the value of the above contour integral must be zero. Thus obtain the 
following form for the partial fraction expansion of /(c): 

m =m + + 2 ) 


62. Generalize the result of Prob. 61 to the extent of allowing /(c) to i>ossess an 
unlimited number of isolated simple f'Hiles {it is then no longer a rational function, 
of course). Through choosing the circular contour in >i!ch a way that no jK)le lies ufxin 
this path at any stage in the limiting process indicated by R - ^ . conclude that the 

above partial fraction expansion of / c) yields (in the limit ;/ x) a convergent 
series (theorem of Mittag-Letilcr). Apply the result to verify the following expressions: 



Convert these into summations over positive integers only. 

63. If a rational function has a zero at infinity with a multiplicity of two or more, 
show that the sum of its residues is zero. 

64. Find the values of the contour integrals 



when the contour C is defined as: (a) a circle of radius R < S with center at the origin, 
(b) a circle of radius R > 5 with center at the origin; (c) a triangle with vertices at 
the points — 6,/6, — /6; (d) a triangle with vertices at the points -6,/6,/I0"‘^ 

65. Show that the function 


f(z) = r 

+ (—a + ^ -f c)z^ — {ab ac — bc)z — abc (a- + ab + ac + bc)(z — a) 

has a removable singularity at the point z = a through demonstrating that (2 — a) X 
f(z) is regular in this point. Compute the residues of f{z) in the remaining poles. 
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66. Considering the function 


m = 


1 

. 1 

sin - 
z 


within a circle of radius p about the origin, show that for a nonzero p however small, 
the circle nevertheless encloses an unlimited number of poles. 

67. Starting from the definition of the Tschebyscheff polynomials 

cos («cos“‘ z) 


derive the alternate relation 


Tniz) = 


(z +j\ 1 - z-y + (z -j\ \ - z - 
2" 




and show that these functions are finite polynomials in c. 

68. Find the points of stagnation of the function 

f{z) - t'o ^2 -b ^ -h y — In c 

in which to, R, and C are real constants, and map their locus in the s-plane as the 
quantity C7(47n’o/?) varies continuously from - * through zero to *. 

69. For any function u* = fiz) = u{x,y) + jv(x,y) satisfying the Cauchy-Riemann 
equations, regard M(.v,y) and t(.v,y) as representing altitude functions and thus 
defining a pair of surfaces si and 52- Show that a ma.ximum or minimum in the altitude 
u coincides with a maximum or minimum in the altitude v and with a saddle point of 
the function f{z). 

70. Find the saddle points of the function 

f{z) = 

in which /, u, and h are real constants. Show that they always lie in the real or imagi- 
nary a.\es and that their positions are controlled by the ratio t/a. 

71. Inside a closed contour C’ the single- valued function /(s) is regular and con- 
tinuous except at poles in the points <ii, U2, * * • Un with the residues fti, 62 • • • bn. If L is 
the length of this contour and M is the maximum absolute value of f{z) on the contour 
show that 

bk 
1 

72. Consider the rational function 
T( \ — ^0 ~b <^i2 4- UoS * + • • • + 

^0 + biZ -h ^23^ + • * • + bnZ^ 


< Mh 

“ 27r 


and let ki, • • • kp (p ^ n) be the residues in its poles which may or may not 
be simple. C is a circle about the origin with radius R large enough to enclose all the 
poles. Points on this circle are denoted by 2 = Re^"^. Show: 
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(a) That 


+ki+-- - +k„\^ K J'' |/(z)| d4, 


(b) If « > (;// -h 1) that 


(^1 + ^2 -f * • * + ^p) = 0 


(c) If n = -f 1) that 


(^1 + ^2 + * • • + kp ) = ■ 


How can one obtain an expression for the sum of the residues when n < (w 4- 1 )? 
7.^. Kvaluate the following integrals involving functions of a real variable .v 


r* x‘dx ^ r* dx 

Jo -r' + 1 Jo ? + 1 


through replacing .v by the complex variable z — x 4-.;y and using the methods of 
contour integration, choosing as the closed contour the real axis from —RioR and a 
semicircle of radius R which is regarded as having unlimited magnitude Consider 
whether the semicircle should lie in the upper or lower half plane, and show that the 
contribution of the semicircular path increment to the value of the integral is negligible. 

74. z = is a point on the unit circle about the origin, and yp (cos 6, sin d, is a 
rational function of sin 6 and cos 6 Show that 

jT ^ ^ 


in which /fc) is a rational function of 2 , and C is the unit circle about the origin. 

75. Using the results of Prob. 74, verify the following integration 



dd Itt 

1 + — 2u cos S \(i- — 1| 


76. Within a closed contour C the function fiz) has simple poles at the points 
2i, 22 , • • • 2n with the residues ^ 2 , * * * bn respectively. Inside the same contour the 
function cot ttz has p simple poles. Show that 

\ n »» n 

IT. r /(2) TTzdz = 2 /(^ + ^) + TT 2 bk cot irzk 

2jJc *=1 A:-l 


in which m is an appropriate integer. 

77. Through replacing x by z, choosing a closed coiitour consisting of one large 
semicircle of radius R^ a small one of radius p (both in the upper or lower half plane) 
confluent with portions of the real axis, and showing that contributions due to integra- 
tion along the semicircular path increments vanish in the limits R » and p — > 0, 
obtain the result 


l 


(Inxr 

1 


dx — — 


T 


s 


4 


It is now proposed to find the integral of the same function between the limits 0 and « 
through considering the change of variable x — ► ( — x) in the integration over « to 0, 
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noting that 

[In i-x)y = (Inx ±jTry = (Inac)^ - tt^ ±>27rlnjc 
Thus, after obtaining the collateral results, 


X * dx TT - r* ln.v 

l+x-~2 X 1 +*' 


.dx -d 


show that 


X 




78. By means of complex integration, following a pattern similar to that suggested 
in Prob. 77, check the values of the following definite integrals 

X * COS.V ^ ire-^ sin .r , xe"® . , 

— dx = I dx = smh a 

X- + a- 2a Jo x- + a- 2a 


Express cos s and sin z in terms of the exponential function and correspondingly 
represent each integral as a sum of components. Choose composite contours consisting 
of the real axis and a semicircle of radius ♦ x , placing the latter in the upper or 
lower half plane according to the requirement that this path increment shall contribute 
nothing to the value of the contour integral. Note that the second of the two integrals 
above remains proper for t/ — » 0 If the constant a is regarded as real throughout the 
process of evaluation, is the result nevertheless still valid for any complex a-value, 
for example, for a pure imaginary value? 

79. If fiz) is a rational function having a zero at infinity with a multiplicity equal 
to or greater than one, and if Fit) is a real function of the real variable t which is zero 
for / < 0 and has the property that — 0 for / — » x (assuming z to have a 

positive real part), the following pair of mutual integral relations hold: 

Pit) = r f{^)e^^dz f{z) = f F{t)e~‘^dt 
-X/ %J~ J w J—oo 

Check these relations for the following pairs of functions: 


= (. +"a)- 

F(/) - re-‘“ 


F(() = sin at 


F(t) = cos at 

r/ X 2 + a 

{z + ay+y- 

F(t) = cos bt 


The first of the two integrals is to be evaluated using complex integration along a 
closed contour consisting of the imaginary axis and a large semicircle of radius R » 
lying in the left or right half plane according to whether / > 0 or / < 0 (see Art. 26 of 
Ch. VH). The second integral in the above pair is to be evaluated according to the 
methods of real integration. 

80. Expand the following rational functions in partial fractions and check the 
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relations regarding the residues as stated in parts {h) and (c) of Prob. 72: 


(a) 

+ ’2 - 1) 

(b) 

(2= - 2)(2- - 2 - 1) 

(c) 

2" + ’2 - 1 

(d) 

(2 - 3)^2 + 8) 


(s — 3)"(s -f tS) 

1 

{Z - lYiz - 3)5 


81. Discuss the structure of the Riemann surface associated with each of the 
following functions: 


(a) w = 


(c) TV = \ \z — a){z — b)- 



(d) It* = \ (z — 5)(s“ +62 -j- 13) 


pointing out the location and order of branch points and other singularities as well as 
the character of the point 2 = x . 

82. Discuss the Riemann surfaces in both the u- and 2-planes asstKiated with the 
function li* = z^ 

83. For the function il’ = consider the leaves of the Riemann surface defined 
by the statements: 

—X < arg. 2 < X corresponds to leaf I. 

X < arg. 2 < 3x corresponds to leaf II. 

3x < arg. 2 < 5x corresponds to leaf III. 

In the u’-plane plot loci corresponding to the following straight lines in the 2-plane: 

(a) Parallel to the real a.xis at distances O.I and —0.1 from the origin and Iving in 
leaf I. 

(b) Parallel to the imaginary axis at the distance 0.1 from the origin and Iving in 
leaf III. 

(c) Parallel to the imaginary axis at the distance —0.1 from the origin and lying 
in leaves I and II. 

84. For each of the following functions w = f(z) construct the algebraic equation 
F{z,w) = 0, w'hich generates the complete function, and from this equation determine 
the inverse function z = 

(a) w - Vz^ — 1 (h) w - z A- Vz^ -h 1 (c) w = ^2 

Describe in each case the structure of fhe Riemann surfaces in the w- and z-planes, 
pointing out the locations and multiplicities of branch points, branch cuts, etc. 

85. Recognizing that each of the complete functions w = y/ z and IF = v z — 1 

has two branches, construct the algebraic equation that generates the function U - 
w Determine this function as well as its inverse z - 4>{U)j and discuss com- 

pletely the Riemann surfaces in the U- and z-planes. 

86 . The expressions 

w - w = \/l — sin^ z 

represent pairs of single-valued functions rather than multivalued functions. Demon- 
strate the truth of this statement through showing that the two functions of a pair 
cannot be obtained one from the other by the process of analytic continuation. 
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87. Through the use of Hilbert transforms solve the following: 

(a) given u(0,y) = find KO.y) 

(b) given «(0,y) = 1° ^ find i'(0,y) 


(c) given m(0,v) 


1 1 + ^- for —yi <y <0 
i Vl 

) V 

; 1 — ' for 0 < y < find i'(0,y) 

1^0 for ,v, > Vl 


(d) given M(0,y) 


(e) given miO.v) 


' (f) given r(0,y) 


>'i 


for — Vl < y < 0 


Vl 


for 0 < y < yi 
fur y > yi 


find i-c0,yj 


1 1 ^v -f yi) 5 

‘ 2 + ■ — 2 — for -yi - - < y < -yi + 

1 (v - Vl) 8 8 

2 - ' ^ ' for Vl - ^ < y < yi H- - 

1 1 for |y| < yi - ^ 


0 for |yl > yi + 


' - - V for -Vl < V < yi 
!>’! 

i —/ITT for y < — yi 
\fnr for y > yi 


find w(0,y) 


find t’(0,y) 


<o 1 



CHAPTER VII 


Fourier Series and Integrals 

1. Finite trigonometric polynomials 

In discussing the convergence of Fourier series it is necessary to have 
compact expressions for their })artial sums. For this reason, and also 
because one hnds expressions of this sort useful in \ arious other problems 
having to do with trigonometric series, a number of formulas are de- 
veloped whereby a variety of finite trigonometric polynomials are given 
in closed form. 

In view of the well-known geometric series 

— = l+z + z^ + z^ + --- [1] 


one may write 


z" 1 - z 


1 I ^Ti t-.'i 


= 1 + s + + . . . + 2-' 


- 1 + S~' + s“- + • ■ • + Z-” 


By addition and subtraction respectively these equations yield 
1 

(1 + c-) = 2 fl + -liz + s-‘) + l{z^ + 2-2) + • • • + 


|-(z" + 2 -")) [5] 


( 1 - 2 -") = 2 {|( 2 - 2 -‘) + ^( 22 - 2 - 2 ) + ... + [ 6 ] 


Letting 


and noting that 


z = *= cos X +j sin X 

gti _ ^jnx _ ^ j gJjj 
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one finds for example, that 


1 - 2”-^^ 
1 — Z 


2(n+0/2 __ „-{n+l)l 2 

(1 + 2 -") .;2 „- l /2 ( 2 ”'* + 2 “”'") 

Z dj 

X X 

2 sin {n 1 ) - cos n - 


sm 


[9] 


By use of well-known trigonometric identities, this may further be 
transformed as shown by 


XX X 

2 sin (« -f 1) ' cos n - sin {2n + 0 2 


. X 

sm- 


+ 1 


[ 10 ] 


The use of manipulations of this sort enables one to obtain from Eqs. 
5 and 6 the formulas 


j sin (2« + 

q — = 1 -f cos A* -f cos 2 jc + • • • + cos nx [11] 


and 

1 

cot sin x + sin 2.r + • • • + sin nx [12] 

A ^ ^ 

2 sin 2 

The compact forms thus obtained for the finite trigonometric polynomials 
given by the right-hand sides of Eqs. 11 and 12 are useful in a variety of 
practical problems as well as in connection with various theoretical dis- 
cussions regarding Fourier series. Other formulas are readily obtained 
from Eqs. 11 and 12. For example, replacing the variable x in Eq. 11 
by {x + t) yields 

J (-1)" cos (2« + 1) ^ 

- -1 1 ~ COS a; -t- cos 2a: — cos 3 a; -f • • • 

2 ^ X 

2cos- 

+ ( — 1)"cos>m: [13] 

which is the same polynomial but with alternating signs. Making the samp 
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change of variable in Eq. 12 gives 


^ (- 1 )"-' sin 1 )*^ 

1 .V 2 

tan H 

2 2 , .V 


= sin X — sin 2 .v -}- sin 3.r — • • • -f- 


(-1)” * sin ;/.v [14] 

Subtracting Eq. 13 for ;/ odd from E(|. 1 1 yields a formula for a cosine 
polynomial with only odd integer multiples of .v, thus: 

sin {n + l).v . . . _ 

— - — : = cos .V -h cos 3x -f- cos v*).!* -[-••• + cos ;/.v 1 ^ 

2 sin .V 

Similarly, adding Eq. 12 and Eq. 14 for ;/ odd yields 
1 - cos (« + n.r . 


2 sin X 


sin .V -f sin 3.r -f- sin 5.v -f • • • + sin nx [16] 


in which sin“ (« -h 1 ) ^ may be written instead of .>[1 — cos (n -}- 1 ).v] 
if desired. 

. TT 

In Eq. 15 one may introduce the change of variable .v— ►.v -f ^ 

obtain a sine polynomial with odd multiples of .v and alternating signs, 
thus: 


f-l)(n-i >/2 sin (n + 1 )a* 


2 cos X 


= sin X — sin 3x + sin 5x 


+ sin ux [17] 


Making the same change of variable in Eq. 16 yields 

1 + ( — _j_ 

= cos x — cos 3x -f cos XX — • • • 

2 cos x 

4 . cos nx [18] 


In these last two transformations it is significant to note that Eqs. 15 
and 16 apply only to odd integer values of w, so that (n -f 1 ) is an even 

integer. Hence sin (w -f 1) ^ = 0; whence 

cos (w + 1 ) 4 - 0 = cos (w + 1 )x cos (» + 1 — sin (w + 1 )x sin (^+ 0 ^ 

= ( — cos (»+l)x = — ( — cos {n+\)x [19] 
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and 

sin {n + \)(^x+^ = sin (n + l)a;cos 4-cos {n +l)jcsin (w + 1)^ 

= (~l)(«+»)/^sin (n4-l)x= - (- sin {n+l)x [20] 


2. The orthogonality relations and their significance in the 

EXPANSION OF ARBITRARY FUNCTIONS 


The trigonometric functions, in common with many other kinds of 
so-called systems of proper functions, possess a very interesting and 
imix)rtant property which greatly facilitates the process of represent- 
ing an arbitrary function over a given interval by a series in terms 
of proper functions. Since the underlying principles of this process have 
a broad significance in the solution to problems in potential theory 
and wave motion, a few rather general introductory^ remarks are in 
order. 

For detailed discussion of the derivation of the following fundamental 
dilTerential equations and for physical interpretation of them, the reader 
is referred to other portions of this reference series, the chief interest 
at the moment being focused primarily upon their purely mathematical 
import. In dealing with problems in potential theory, in which a desired 
potential function is a function of the space co-ordinates {x,y,z) alone, 
Laplace’s equation 

= 0 [ 21 ] 


is found to govern the behavior of that funct'on, whereas in problems 
involving wave motion, in which the time co ordinate also is involved, 
this equation is modified by the appearance of an additional term so that 
it reads 


dt" 


[ 22 ] 


In the first of these two cases the potential is static, whereas in the 
second it is dynamic — that is, it varies with the time as well as with the 
space co-ordinates. In other words, a problem in wave motion is simply 
a problem in potential theory with an added time co-ordinate. 

The form of the Laplacian operator depends upon the particular 
system of space co-ordinates used.* In the commonest of these, the 


♦See Art. 18, Ch. V. 
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ordinary rectangular Cartesian system, it is given by 

a- d'^ a- 


[23] 


In dealing with the so-called ivave equation 22, the first step is to 
eliminate the time variable. This is done through assuming 


r) = 


[24] 


whence 


a/“ 






[25] 


so that Eq. 22, after cancellation of the common factor reads 

V-V -h = 0 [26] 

in whicli* 



In Eq. 26, as in Eq. 21, the function T is a function of the space co- 
ordinates only. 

The particular form of these equations now depends upon the system 
of co-ordinates used, and the choice of co-ordinates in turn depends upon 
the geometry of the physical system to which the equations apply. For 
a rectangular geometry the ordinary Cartesian co-ordinates are used; 
for a cylindrical geometry, cylindrical co-ordinates; for a spherical geome- 
try, spherical co-ordinates; and so on. Correspondingly, these particular 
forms of the equations are known by certain names, such as Bessel’s 
equation (for cylindrical co-ordinates) or Legendre's equation (for 
spherical co-ordinates), and the particular types of functions which 
formally satisfy the equations are largely known by names which relate 
to the particular geometry of the physical system, such as cylinder 
functions (of which the Bessel functions are the so-called first kind) 
or spherical harmonics (also known as Legendre polynomials), and so on. 
These functions are referred to as the proper functions pertaining to the 
particular physical system under consideration, the simplest of them being 
the trigonometric functions which are the proper functions for systems 
having a rectangular geometry. In a single dimension this type of geome- 
try is that of a straight line, like a stretched violin string, for example. 

In terms of some parameter, like parameter n in sin nx and cos nx 

♦The values ilf2, * * * , which usually are infinite in number, are determined from condi- 
tions which the function is required to satisfy at certain physical boundaries. 

The details of this process need not be considered at the moment. 
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or, in the case of solutions to the wave equation, in terms of the so- 
called proper values these functions form a set or system. In view of 
the linearity of the equations, it follows that a complete formal solution 
is given by a linear superposition of a set of these proper functions with 
different parameter values and arbitrary coefficients. Thus if </>n(^,y,2) 
represents a proper function for the parameter index n, the solution has 
the form 

V = + • • • [28] 

which in general is an infinite series. The coefficients On are regarded as 
constants of integration which give the formal solution 28 the necessary 
flexibility of meeting certain boundary' conditions set by the physical 
problem. 

Thus in a two-dimensional problem in static potential theory, for 
example, the potential T)(.v,y) may have to become identical with a 
certain function /(.v) for y equal to a particular value, say y = 0, wrhich 
characterizes a physical boundary. Or, in a problem of w'ave motion, the 
function T)(.v,y,c,/) may, for the temporal boundary' / = 0, for which 
(according to Eq. 24) T) = r(.v,y, 2 ), have to meet some prescribed 
function. For example, a stretched membrane for which T) represents 
the displacement of various points from an equilibrium position is given 
a particular deformation from wiiich it is suddenly released at an instant 
w’hich is designated as / = 0, or an electrical transmission line (which 
involves a single space co-ordinate) has, at a given initial instant, dis- 
tributed upon it certain charges w'hich give rise to a specific potential 
function versus distance along the line. These are referred to as initial 
or boundary distributions. Inasmuch as they may, in a physical system, 
be arbitrarily specified, the process of solution must be able to fit a series 
of the form given by Eq. 28 to a specified function of one or more of the 
independent variables. The fulfillment of these conditions may neces- 
sitate the selection of certain kinds of proper functions any of w'hich 
formally satisfy the differential equations, such as the selection, for 
example, of a particular kind of cylinder function in the solution to a 
problem with cylindrical symmetry. 

In one dimension this problem takes the form 

f{x) = ai<pi{x) 4- a 2 <b 2 {x) + a^4)2{x) + • • • [29] 

in which f{x) and the functions 4>n{x) are known but the coefficients 
are to be determined so as to satisfy this equation. The problem here 
presented is determining the expansion of an arbitrary function /(a:) in 
a series of specified proper functions in such a way that the resulting 
series is in general a convergent one. 

The solution to this problem is either impossible, or possible only 
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through the use of ingenious artifices, unless the system of proper func- 
tions (or a derived system formed from linear combinations of these 
functions) satisfies the so-called conditions of orthogonality, which in the 
one-dimensional case are expressed by the equations 


J r^ / \ / \ j for m = « 

I <lhn{x)(Pn{x) dx = \ 

a I 0 for w n 


[30] 


in which a and b are the finite limits of the region over which the function 
f{x) is specified. The quantity is ordinarily a constant and can be made 
equal to unity by the incorporation of an appropriate scale factor. This 
process is called normalization, and the resulting functions 0n(-^‘) are then 
spoken of as a normalized set of orthogonal proper fu notions or an ortho- 
normal set. 

The use of the term “ orthogonality in connection with the relations 
30 is suggested by the parallelism between these conditions and those 
characterizing an orthogonal matrix.* Since the relationship between the 
present problem and that presented by a set of linear algebraic equations, 
which is thus implied, has rather more than superficial significance, 
this item is discussed in greater detail. 

Regarding the partial sum 

Sn(x) = ^101 (a:) + a24>2{x) + • • • + Un<^n(A‘) [31] 

corresponding to the series 29, and assuming that the function Sn{x) 
is specified over the range a ^ x ^ b, one finds that a possible procedure 

for determining the coefficients Ok such that the finite p)olynomial in 

terms of the functions approximates the given Sn{x) is to divide 

the inter\"al a to 6 into n equal subinterv’als whose midpoints are the 
ic-values expressed hy a < Xi < X 2 <•••< Xn < b, and then to write 
the set of n equations 

Sn{Xi) = + ^ 202 ( a ^ i ) + • * • + Ctn4>n(,Xi) 

SnM = ai<l>i{X2) + a2<h2{X2) H + Cln4>n{X2) 


Sn{Xn) = (Il4>l{Xn) + (l2^2{Xrf) + • • * + €tn<l>n(Xn) 

in which the n coefficients Ok are regarded as unknowns. The values of 
the coefficients thus determined, when substituted into Eq. 31, yield a 
function Sn{x) which has the correct values at the selected points Xi • • • 
within the prescribed interval. At any other points, not much can be 
said about the degree of approximation afforded by the finite sum in 
Eq. 31. However, by taking n sufficiently large, and assuming for the 
moment that Sn (x) is a smooth function, one may expect a solution which 
fulfills the requirements of a given physical problem. Indeed, as n is 

*See Ch. H, Art. 6, and Ch. m, Art. 4. 
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chosen larger and larger, one may expect a closer and closer approxima- 
tion to the partial sum Sn{x), which is ultimately identified with the 
specified function fix). 

Such a process of solution requires, for a finite w, the inversion of the 
matrix 


01 (a:i) 

02 (^*1 ) * 

• <i>n(Xi) 

<t>l{X2) 

02U*2) * 

4>ni^2) 

.<t>l (Xn) 

4>2(Xn) * ‘ 

• ■ <t>n(x„)_ 


[33] 


which in general is a laborious task, and becomes hopeless if not impossible 
as n is increased without limit. However, if this matrix is an orthogonal 
one, its inverse is simply given by the transpose of the matrix 33. The 
solution for any « is then immediately written down. 

The orthogonality conditions for the matrix 33 read* 


n 


"L <l>r(Xk)<t>,{Xk) 
k=l 


J 1 for r = 5 
to (or r 7^ s 


[34] 


which are readily recognized as having the same form as the relations 30. 
The solutions to Eqs. 32 are then given by 

n 

ds ^ ^ 4>siXk)SniXk) [^ 5 ] 

This result may be obtained through multiplying Eqs. 32 successively 
by 0 «(.vi), 0 «(.V 2 ), • • • <t>s{Xn)j and adding. In view of the conditions 34, 
the sums of all the columns vanish except that of the .yth, which yields 
a*, whereas the sum of the left-hand members ’ • seen to be given by the 
right-hand side of Eq. 35. 

The equivalent solution for the coefficients in the infinite series 29 is 
obtained through multiplying this scries by <f>six) and integrating over 
the region a ^ x ^ b. Then if the conditions 30 hold with = 1, there 
results 


— j* ^ 

which represents the desired solution. 

In order to derive the orthogonality conditions for the trigonometric 

•These are the orthogonality conditions for the columns only. .The corresponding ones for 
the rows are not needed for the present discussion. Indeed, it is actually not necessary for 
the matrix to be orthogonal since the existence of orthogonality conditions for only its columns 
is sufficient. 
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functions it is convenient to write them in the exponential form 
sin nx = ) 


cosw.r'= - 4 - 


whence 


: sin nx = - - ^-juu+njx _ 


sin mx cos nx = 7-. — e ^ 

4/ 


[39] 
n [40] 


cos mx cos wa; = + e -h + <» 


Next it is observed that 


X a4-2r 

dx = — 

- 


^yA*x-|«+2» ^ka^^jk2T _ jj 


Since = 1 for all integer values of k, the numerator in this result is 
zero for all )^-values. The denominator is not zero except for ^ = 0. 
Hence the value of the integral in Eq. 42 vanishes for all ^-values except 
= 0, when the right-hand side of this equation assumes an indeterminate 
form. The latter is readily evaluated if the limit A — » 0 is considered to 
proceed gradually. For small values of k the exponential may be 
replaced by a few terms of its Maclaurin series, giving 

+ 7*27r + 1) , 

1 Ti 


Thus it becomes clear that 




Iw lOT k = 0 
0 ioT k 7 ^ 0 


This is an important property of the exponential function. 

By means of the expressions 39, 40, and 41 it is now readily seen that 

r +2» , . [± 7 r forW= 

sm mx sin wo; ax = i . . ^ [45] 

I 0 for w 5*^ dbw . 


0 for w 5*^ dbw 


r 4’2x 

sin mx cos nxdx ^ 0 for all m and n 

r + 2 » . f+» for m = zkt, 

cos war cos M* 0* = 1 - , 

I 0 iorm9^±t 


for m = ±« 9^ 0 
for m 96 ±n 
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The fundamental range ^ so-called, which in the preceding discussion of 
this article is indicated by a x ^ is in the integrals 44, 45, 46, and 47 
seen to be any region throughout which the variable x changes by an 
increment of Iv. The limits on these integrals are, therefore, arbitrary 
except that they must differ by 2 t (or a multiple of Iw). 

By the same process of analysis it is also readily found that 


/: 


G+Dt 


e^^^ dx = 


jk 


0 

TT 


for k odd 

for k even except 
for ^ = 0 


[48] 


in which v is any integer. In other words, if the fundamental range has a 
“width ” of only v instead of 27r, a result similar to the one expressed by 
Eq. 44 exists only for k even, and then only when the lower limit on the 
integral is an integer multiple (including zero) of tt. 

If it is now observed that, for any integers m and «, {m + n) and 
(m — n) are either both even or both odd, the result 48 together with the 
relations 39, and 41 shows that 


I 


>(»+!)» 


sin mx sin nx dx = 


and 




>+!)» 


cos mx cos nx dx = 


TT 

"=2 

for m = zLn ^ 0 

[49] 

0 

for m 9 ^ zkn 


^-2 

for 7)1 = 9 ^ 0 

[50] 

0 

ior m 9 ^ zb n 


In other words, the trigonometric 


functions sin nx and cos nx also form orthogonal sets over a fundamental 
range which is only v units wide. However, it is to be observed that a 
relation similar to the one expressed by Eq. 46 does not hold for a range 
of this width. 

If the range 0 < .r < x is divided into n equal subranges, and the 
values Xi, X 2 ,- ' • Xn refer to the centers of these subranges, as shown in 
Figs. 1 and 2 for w = 8, then matrices corresponding to the matrix 33 
may be formed for the sine and cosine functions. 

Thus it is found that 


Xu = 


{2k - l)x 
2n 


[ 51 ] 


SO that if 


aus = COSSXu 


[52] 
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Fig. 1. Division of sine into sub- Fk;. 2. Division of cosine into sub- 
ranges appropriate to the formation ranges appropriate to the formation 

of an orthogonal matrix. of an orthogonal matrix. 



m 

[ 54 ] 

[ 55 ] 

[ 56 ] 

[ 57 ] 
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These may be put into a form in which they more nearly simulate the 
expressions 49 and 50 through noting, with reference to Figs. 1 and 2, that 


TT 

Ajc = - 
n 


Hence 


21 OtkrOtks Ax = { 
k=l 


and 


^krPks = 


TT . 

2 loT r — dzs except 
IT for r =5 = 0 

0 iorr 9^ ±s orr = ±:{s = n) 


rb - forr = ±5 except 

db TT for r = it (5 = n) 

0 for r ±5 or r = 5 = 0 


[58] 


[59] 


[60] 


These, or the expressions 56 and 57, may be verified independently. 
Thus 

otkrCtkB = cos rXk cos 5Xit = §|cos (r - s)xk + cos (r + s)xk} [61] 
and 

^kr^ks = sin rxk sin sxk = |icos (r - s)xk - cos (r + 5).V]t) [62] 

Now by Eq. 51 

^ , N ^ f(2^ — l)(r — j)7rl 

2 cos (r - s)xk = L cos 

*=1 A;=1 I 2 n J 

(r - s)Tr 3(r - s)Tr {2n - l)(r - s)Tr 

= cos h cos f- • • • + CO- [63] 


2n 


2n 


271 


This expression is the same as the right-hand side of Eq. 15 of the pre- 
vious article except that ~ A takes the place of x, and {2n — 1) 

takes the place of Hence it is found that 

sin (r — 5)7r __ ! 71 foTr = s 


2 cos (r - s)xk - . . 

ib-i ^ . (r - s)Tr 

2 sin 

271 


0 for r 5 ^ 5 except 
[ —71 for r = —(s = n) 


Similarly 

^ / . \ sin (r -f s)Tr 

E cos (r + 5)a:jfc = = < 

Jt-i „ . (>^ + 


2 sin 


2n 


71 forr = —s 
0 for r 7 *^ — ^ except 
[ —n for r = (5 = n) 


[64] 


[ 65 ] 
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The last two relations, together with those stated by Eqs. 61 and 62, 
verify the conditions given by Eqs. 56 and 57. 

If the functions aka and (Iks arc normalized through division by 
\n;2, it is observed that they fulfill conditions of the form given by 
Eq. 34 except for r = 5 = n. This means that if the coefficients in the 
partial sum 31 are approximately evaluated by means of the formula 35, 
a correction must be applied for the coefficient t/„. It should be observed 
in this connection, however, that the values of the coefficients ak obtained 
by such a method become more and more approximate the nearer the 
index k approaches n. As pointed out previously, the partial sum Sn{x) 
converges toward the desired function /(.v) only as n is increased without 
limit. For a sufficiently large finite «, the partial sum 5„(.v) may be 
regarded as an approximation to f{x) which is close enough for certain 
practical purposes, but the degree of the approxiniation is then very 
likely little affected if one entirely ignores the last term, or perhaps 
several terms in this vicinity of the partial sum 5„(.v). This item is further 
discussed in Art. 15. 

It may also be observed that the matrices 54 and 55, with the elements 
as defined by Eqs. 52 and 53, are actually not orthogonal matrices since 
the orthogonality conditions are fulfilled only for their columns, and not 
for their rows. As far as the present problem is concerned, however, 
this situation is not significant since none of the reasoning given in the 
present article is thereby affected. 


3. The fourier series 

The previous article indicates how a representation in the form of an 
infinite trigonometric series may be found for a given function f{x) 
specified over a finite interval a ^ x ^ a 2 t. Incidentally, this desig- 
nation for the interval may be regarded as perfectly general inasmuch 
as any other, such as a < jr < a -f f , is converted to it by means of the 
x( 

change of variable a: — > — • 

2ir 

The trigonometric series, or Fourier series, as it is called, has the form 

/(^) = ^ + dicosx + 02 cos 2x + fla cos 3x -f- • • • r 

2 [66J 

-f bi sin X + b 2 sin 2x + is sin 3x + • 

The necessity of having both sine and cosine terms is readily recognized. 
Writing the series 66 in the more compact form 


/(^) = "T + 2^ OnCosnx + 2^ bn sin nx 

2 n-l n-1 


[ 67 ] 



Art. 3] 


THE FOURIER SERIES 


449 


multiplying both sides by cos wx, and integrating over the fundamental 
range, one has 


r +2r 

f (jr) cos mx ~ J 


+ 


f 2 COS nx cos mx dx 

“ n =lj 

/^a+2ir • 

+ / ^ bn sin nx cos mx rfx 

«/a n =1 


[68] 


The first integral on the right-hand side is zero except for m = 0. Exclud- 
ing this value, and assuming for the moment that the series 66 or 67 
converges uniformly over the fundamental range a ^ x ^ a It, so 
that the integration of the infinite sums may be carried out term by 
term, one finds 


r + 2r • pi 

f (x) cos mx dx — ^ an I 

n =1 

+ i r 

n = 1 t/a 


*a+2T 


COS nx cos mx dx 

•a4-2T 

sin nx cos mx dx 


[69] 


In view of the conditions expressed by Eqs. 46 and 47, one obtains 

1 pa+2x 

<'m = - i fix) COS mx dx [70] 

V «/a 


This formula evaluates the coefficients of the cosine series, and inci- 
dentally gives the coefficient Qq correctly also, in uch as the ortho- 
gonality conditions 46 and 47 applied to the term-^c( ^nn integration in 
Eq. 68 yields for m = 0 




X 


*a+2r 


dx = aoTT 


[71] 


If Eq. 67 is multiplied by sin mx and integrated term by term over 
the fundamental range, it being recognized that the term with Oq then 
yields zero even for m = 0, one obtains 


r +2r • pa 

/(x) sin mx dx = Z) / 

n si «/o 

+ 


+ 2 » 


COS fix sin mx dx 


- pa+2w 

T, bn I sin nx sin mx dx 

n si t/tt 


[72] 


Here the use of the orthogonality conditions 45 and 46 gives 

I pa+2T 

bm = - I fix) sin mx dx 


[ 73 ] 
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which is a formula for the coefficients of the sine series in Eq. 66 or 67. 

A convenient form for the formulas 70 and 71 is obtained if the funda- 
mental range is specified as — tt ^ .v ^ tt, as can always be accomplished 
by a suitable definition for the independent variable. Then, n being 
wTitten in place of tn to provide a more evident consistency with the form 
of the series as stated by Eq. 67, the formulas read* 

1 

On = - I f(x) cos ;/.v dXj for ;z = 0, 1, 2, • • • [74] 


and 

= - f f{x) sin Hx dx, for /z = 1, 2, 3, • • • [75] 

It is now significant to observe that if the specified function f(x) is 
even, that is, if 

f(-x)=f{x) [76] 

the integrand in the integral for bn is an odd function of x, and the 
integral between the s\Tnmetrical limits — tt and tt vanishes. In other 
words, all the coefficients bn are then zero, so that the function /(.r) is 
represented by a cosine series alone. This should, of course, be expected 
for an even function f{x) since the cosine functions are even and the sine 
functions odd. 

On the other hand if f(x), with regard to the fundamental range, 
— IT ^ X ^ TT, is an odd function of x, that is, if 

f{-x) = -fix) [77] 

the integrand in the integral for a n is an odd function, and this integral 
vanishes for the limits — tt to tt. All the coefficients of the cosine terms 
are then zero, and the function f(x) is represented by a sine series alone. 

This state of affairs suggests that in general one may write 

/W =/i(^) +f2ix) [78] 

in which fi(x) is an even function and f 2 (x) an odd one. That such a 
representation for the function /(x) should always be possible is clear 
from the fact that if x in Eq. 78 is replaced by — x, and the relations 
76 and 77 are observed with regard to the even and odd functions /i(x) 
and f 2 (x) respectively, there results 

fi-x) =fiix) -fiix) [79] 

*It may not be superfluous to point out that the derivation of these formulas for the 
Fourier coefficients in no way proves that the series can actually be used to represent a periodic 
function. The conditions under which such a representation is possible and a discussion of 
the convergence of the series are given in Arts. 7, 8, 12, 13, 14. 
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Addition and subtraction of Eqs. 78 and 79 then yield 

hix) = +f{-x)\ [80] 

and 

J2{x) = tl/fx) [81] 

Hence the even and odd components fi{x) and f 2 (x) of an arbitrary 
function fix) can always be uniquely determined either analytically 
or graphically. 

Substituting the representation 78 for fix) into the integrals of Eqs. 
74 and 75, and taking note of the odd and even character of the functions 
fzix) and fiix) respectively, one finds that 

1 

= / /i (-v) COS nx dx, for ;; = 0, 1, 2, • • • [82] 


and 


1 T' 

= / f'zi^) sin nx dx, for 7t = 1, 2, 3, 

TT «/— »■ 


[83] 


In other words, the coefficients of the cosine terms in the series 66 are 
determined from the even component of fix) alone, and the coefficients 
of the sine terms are determined from the odd component alone. Inas- 
much as an arbitrary function fix) must in general contain both even 
and odd components according to the decomposition expressed by Eq. 
78, it follows that the trigonometric series represent ing/(.r) must contain 
both sine and cosine terms. When it is assumed, then, that the even and 
odd functions respectively are completely represented by cosine and sine 
series, it follows that the form of the assumed series representation for 
fix) as given by Eq. 66 is also sufficient. Thi- question, as well as the 
manner in which the trigonometric series approximates the function /(jc) 
as the number of terms in its partial sum is increased, is discussed in the 
subsequent articles. 

In the meantime it may be useful to obser\’e that the integrands in 
both the integrals 82 and 83 are even functions of x. Consequently the 
same value is obtained if the integration is extended only over the range 
zero to IT and the result is multiplied by two. This gives the alternative 
formulas 

dn = - I fi ix) COS nx dx, for w = 0, 1, 2, • • • [84] 

IT Jo 

and 

f 2 ix) sin nx dx, for n = 1, 2, 3, • • • [85] 
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In the discussion so far, it has been assumed that the function /(a:) is 
defined only within the interval a ^ x ^ a + 2 ^^ that is, that the 
function does not necessarily exist also for values of x not in the stated 
range. For example, if/(.r) represents the initial distribution of voltage 
on a transmission line and the interval a ^ Jt* ^ a + 27 r corresponds, by 
reason of a suitable change of variable, to the length of that line, it is 
manifestly clear that an inquiry regarding the values of this function 
beyond the limits of its fundamental range has no sense. From a practical 
point of view such an inquiry does not arise since one is content with a 
solution valid over the extent of the physical system. 

It is, on the other hand, sensible to inquire, for the moment out of 
pure curiosity perhaps, what kind of function the trigonometric series 
represents when all restrictions on the independent variable are removed, 
in other words, if Ji* is allowed to take on all values "from — to <» . In 
view of the periodic character of the trigonometric functions, the answer 
to this question is ob\ious. The trigonometric series represents a periodic 
function with the period 2x, such that the behavior of the function 
throughout any one period matches that of the function f{x) over its 
fundamental range. 

It is thus clear that the trigonometric series in Eq. 66 is capable of 
representing an arbitrary periodic function over the entire range of its 
independent variable from minus to plus infinity. Such functions occur 
quite frequently in engineering analysis, the independent variable 
usually being the time L The periodicity of the function is expressed by 
the relation 

/« + M =/(0 [ 86 ] 

in which r is the period (sometimes referred to as the fundamental period) 
and k is any positive or negative integer. The reciprocal of t is the 
fundamental frequency 

and 

„ = 2 »/ = — [ 88 ] 

T 

is the fundamental angular frequency or radian frequency. 

The Fourier series representation for the periodic function /(/) is 
written 


/(/) ^ + ^2 cos + • • • 


+ hi sin (at -|- 62 sin -f- . . . 


[ 89 ] 
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in which, according to Eqs. 74 and 75, 


dn 



cos YUsii dt, 


for » = 0, 1, 2, • • • 


and 


hn 



sin wo)/ d/, 


for n = 1, 2, 3, • • • 


[90] 


[91] 


4. Thp: phase angles of the harmonic components 


In view of the trigonometric identities 

Cn cos {fUat + (t>n) = Cn sin ^ 

s Cn COS 4>n COS 7io)t — Cn sln 0n sin fUat [92] 


it is possible to combine the sine and cosine series in Eqs. 66 or 89 through 
letting 


dn = Cn cos 4>n 
bfi = Cn sin <f>n 


[93] 


whence 


Cn = Va„* + 


P4] 


• The Fourier series can then be written as a sum of sine or cosine terms 
alone, for example, as 

/(/) ~ 4“ 4- <t>i) + C 2 cos + <^ 2 ) + * • • [95] 


The term having the fundamental angular frequency is commonly 
called the fufidamental component of the periodic function /(/), and the 
remaining terms, whose frequencies are integer multiples of the funda- 
mental, are referred to as harmonics. The coefficients Cn and the angles 
which are determined by Eqs. 94 together with Eqs. 90 and 91, are known 
as the harmonic amplitudes and phase angles. 

Because of the periodic nature of the function /(/) it is usually possible 
to select the origin for the independent variable t at any convenient point. 
In subsequent manipulations it may, nevertheless, be desirable to shift 
the origin to a new location. Thus, if the variable t in /(/) is replaced by 
(/ — to)y the effect is to shift the origin back by an amount equal to to 
(this shift amounts to retarding the sequence of values of the function 
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by /o)- Graphically, this change of variable corresponds to shifting the plot 
of the function by an amount /q. A typical harmonic component 
becomes 

Cn cos [«<*>(/- /o) -f <^n] = Cn COS [woj/ + (</>„ - Woj/o)] [96] 

This result may e\ddently be interpreted as a change in the harmonic 
phase angle to the new value 

f lirntn 

n = <i>n — = 0n [97] 

T 

It is significant to note here that each harmonic phase angle is changed 
by an increment proportional to the order n of the corresponding har- 
monic component. Such a prop)ortionate change in the harmonic phase 
angles, therefore, leaves the resultant form of the function /(/) unchanged 
except for a translation of the function as a whole. 


5. E\tn and odd harmonics 

It frequently occurs that the given perodic function satisfies the 
condition 

!)=-/(/) [98] 

which means that the sequence of values of the function throughout 
any half p>eriod are the negatives of the values encountered throughout 
the preceding or succeeding half period. In order to observe the effect of 
this condition upon the form of the resulting Fourier series, one may write 
the expressions 90 and 91 for the coefficients in the form 

a„ = ^ j[ ' [/(/) cos cos nw dl [99] 


b„ = ^ [/(O sin tkol +f(^t - 0 sin nu(^t - 0] dt [100] 


These ate readily recognized as being equivalent to Eqs. 90 and 91 
through noting, for example, that 


I fiO cos Mw< dt = — 0 cos n<a — 0 dt [101] 


because the second integral is obtained from the first when one makes the 
substitution t-*t — rjl and appropriately changes the limits of Integra- 
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tion. Then, observing that m^rll ^ mr, and hence that 



cosnojh 

5 — - 1 = cos nir cos not 



\ 

/ 

t\ 


[102] 


sin 1 i 

1 — -y = cos mr sin not 


one finds that the condition 

98 substituted into Eqs. 99 and 100 

is seen 

to yield 






03 

TT 

cos Wtt) J 

nr ,'u) 

' f(t) COS not dt 

D 

[103] 

and 






03 

cos nir) J 

nr/u 



6n = - (1 - 
IT 

f{t) sin not dt 

[104] 


The factor (1 — cos ;z7r) is zero for all even integers of n and equal to 
2 for all odd integer values of n. Hence it is clear that when the periodic 
function has the property expressed by Eq. 98, its Fourier series contains 
odd harmonics only. The average value or constant component a^/l is 
evidently zero also. The coefficients of the odd harmonics are then given 


by 

2o 

an = — 1 

nrf<a 

f f{t) COS not dt 

[105] 

and 

TT o 

TT J 

0 

nxita 

^ / (/) sin not dt 

[106] 


These considerations may suggest the question of what the results are 
when the function /(/) satisfies a relation c6mplementar>" to that ex- 
pressed by Eq. 98, namely, when 

/(/±0=+/(O [107] 

Although recognizing that the factor (1 — cos mr) in Eqs. 103 and 104 is 
changed to (1 + cos mr) gives the answer a moment^s reflection reveals 
that the condition 107 merely states that /(/) has the period r/2 instead 
of T. If T is retained as the fundamental period, the result is that the 
Fourier series contains only even harmonics, which is just another way 
of saying that the period of /(/) is actually half as large. 

On the basis of these thoughts it appears that any periodic function 
may be assumed to consist of two components, of which one satisfies the 
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condition 98 and the other has twice the fundamental frequency. This 
decomposition may be indicated by 



/(/) = 

fliO +/ii(/) 

[108] 

in which 



[109] 

and 

/ii zfc — /ii(/) 

It then follows that 

[110] 


■^v * 2 ' 

1 =/iU) - fiiU) ' 

[HI] 

and hence. 

adding and subtracting Eqs. 108 and 111, that 



/i(0 = \ 

i/(0 +/^/ ± 2)1 

[112] 


/n(0 = 

-H 

^ ^ 

1 

[113] 


This decomposition is similar in analytic form to the decomposition 
into even and odd components as stated by Eqs. 80 and 81, but should 

not be confused with the latter. The 
decomposition into even and odd 
components according to Eqs. 80 and 
81 is a decomposition of the given 
function into components which 



are respectively symmetrical and 
antisymmetrical about the ordinates 
at = 0 or / = 0, and hence yields 
a decomposition of the Fourier series 
into its component cosine and sine 
series, each of which in general con- 
tains both even and odd harmonics, 
and hence can be further decomposed according to the relations 112 and 
113. 


Fig. 3. Decomposition of a saw-tooth 
wave into even and odd hannonics 


The component functions given by Eqs. 112 and 113, on the other 
hand, may individually be either even or odd but in general are neither, 
and hence can also be further decomposed. The decomposition indicated 
by Eq. 108 is such that the first component contributes the even har- 
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monic components to the resulting Fourier series, and the second con- 
tributes only the odd harmonics. An interesting example of this decom- 
position is illustrated for the saw-tooth wave shown in part (a) of Fig. 3. 
Here the rectangular wave (shown dotted) represents the odd harmonic 
compx)nents, and the even harmonics are due to the saw-tooth wave of 
double frequency, part (b), which remains after the rectangular com- 
ponent is subtracted from the original saw-tooth wave. 

6. Alternative Fourier expansions for a function defined 

OVER A FINITE RANGE 

The preceding articles show that there is an infinite variety of ways to 
establish a Fourier series representation for a function which is defined 
over a finite region, if the sole object is to obtain a trigonometric series 
which yields the correct values of the stated function over this finite 
range only. One way is to consider the defining range of the given function 
as the fundamental period, as is done in Art. 3. However, since this range 
may alternatively be considered as only a part of the fundamental 
period, and since the definition of the given function over the remainder 
of this period as well as the extent of the period are entirely arbitrary", 
it is clear that any number of trigonometric scries representations may be 
found. All of them yield the same values over that portion of the period 
which corresponds to the original defining range, although they may show 
a variety of beha\iors throughout the remainder of each period. 

It should be observ’ed that if the determination of a Fourier expansion 
appears as part of the process of solving a boundary value problem, such 
a variety of alternative possible procedures does 
Xiot exist, because the conditions of the problem 
permit the choice of only one set of proper 
functions in terms of which the expansion may 
be made. There are, however, many other ways 
in which Fourier expansions enter into engi- 
neering analysis, and in most of them the ob- 
ject of using the series representation is solely 
to get an approximating function in the form of a 
trigonometric series for some given function whose behavior is specified 
over a finite interval. Inasmuch as the engineer must, for practical 
reasons, limit the expansion to a finite number of terms, and the computa- 
tional labor as well as other economic factors dictates that this finite 
number shall be as small as possible, the freedom of choice mentioned 
in the preceding paragraph must be recognized as an important con- 
sideration in the selection of a suitable process of analysis. 

As a simple illustration of how a variety of series representations may 


^ f(x)=ax 



Fig. 4. A function de- 
fined over a limited range. 
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be obtained for a given function defined over a finite range, it is interest- 
ing to consider the function illustrated in Fig. 4 for which the defining 
range is the region 0 ^ Xi. Parts (a) to (e) of Fig. 5 show several 


.'I 

1 

sines and cosines 

^ 

1 

1 

1 ^ • 

1 -*^1 I 1 

1 '4 

odd harmonics only 

(a) 

sines only 

^ .-"1 _,-"i 

1 

0 i 1 

W-* 1 

even and odd harmonics 

(b) 

cosines only 

^ ^ 1 • 


^ Ti 1 

1 ,^ 

odd harmonics only 

(c) 

sines only 


0 

odd harmonics only 

(d) 

' 

/I 

0 i, 

good approximation with a 
single sine term 

(e) 


Fig. 5. Various possible periodic continuations of the function of Fig. 4. 

ways in which this function may be assumed to be continued beyond the 
defining range so as to yield a periodic function. In each case the behavior 
over the defining range is the same as that given in Fig. 4, but the series 
representations for the individual cases are nevertheless quite different, 
as indicated on each sketch. 
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It is also significant that the rate of convergence of the resulting Fourier 
series may be quite different for the different forms of periodic functions. 
As is also pointed out in a subsequent article, the convergence is in general 
more rapid for a smooth function than it is for one which varies er- 
ratically or has large first or higher derivatives. For example, the series 
for a function which has discontinuities, like those sketched in parts 
(a) to (c) of Fig. 5, in general converges rather slowly, so that a large 
number of terms must be calculated in order to establish a fairly good 
representation for the function. If the latter is continuous, as it is in 
parts (d) and (e) of the figure, the convergence is considerably more 
rapid. The function shown in part (e) has a more rapidly converging 
scries than the function in part (d) whose first derivative has discon- 
tinuities. In fact, the function in part (e) is approximated very well by a 
single sine term. 


7. The Fourier series as a special form of the Laurent 
expansion; the complex Fourier series 

The Fnuricr series may alternatively be obtained from the Laurent 
expansion discussed in Art. 12 of Ch. VI. According to Eqs. 163 and 166 
of that chapter, the e.xpansion has the form 


/(s) = L a„(s - 2o)" 


[114] 


in which the coefficients are given by the 
formula 




IwjJ.'i (s'- - =o)"+' 


[115] 



The region of uniform convergence lies 
between the circles ri and rz indicated in 
Fig. 6, for which it is assumed that ri < 1 
< To. The expansion 114 then converges 
uniformly for all points on the unit circle 
drawn with Zq as a center. As discussed in 
the previous chapter, this statement as- 
sumes, of course, that the function is reg- 
ular and continuous at all points within the annular region enclosed 
by the circles with radii ri and rz. No restriction is implied by the 
assumption that ri < 1 < rz, since this condition may in any case be 
obtained by means of an obvious change of variable, should it not be met 
in the first place. 


Fig. 6. Region of uniform 
convergence of a Laurent series 
from which the Fourier series 
may be obtained by an appro- 
priate change of variable. 
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If the point z in the expansion 114 is assumed to refer to any point on 
the unit circle, and the latter is also chosen as the path of integration S 
in the formula 115, one may write 

II 

.? 

1 

[116] 

and 


f - So = 

[117] 

whence 


- j d>P 

i — •'O 

[118] 

The expansion 114 then takes the form 


m= f ' 

n = — « 

[119] 

in which, according to Eqs. 115, 11 7, and 1 18, 


1 rn+2rf(^)j 1 

[120] 

Since the point z is restricted to lie on a circle about So, f(z) evidently 
reduces to a function of the real variable </>. In order to conform with 
more familiar conventions, the symbol (f> may be replaced by Jt* and the 
variable of integration yp by so that Eqs. 119 and 120 take the form 

fix) = i ane’”^ 

n = — » 

[121] 

and 


1 ra-r2w . 

[122] 

Although the function f(x) is real, it should be observed that the 
coefficients an are complex. The series representation for the function 
f(x), as expressed by Eq. 121, is known as the complex Fourier series. 
It may readily be shown to be entirely equivalent to the real form in 
terms of sines and cosines. The first step in this regard is the observation 
that, according to Eq. 122, the coefficient an is replaced by its conjugate 
value if « is replaced by — that is, 

O-n = «n 

[123] 


This result is clear from the fact that changing the sign of n in Eq. 158 
is equivalent to changing the sign of j. The series 121 may be written 
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out in the form 



f(.r\ - 4- f + • ■ • 

J{,X) ao + ^ ^ ^ . . 

[124] 

from which it is seen that the terms appear in pairs of conjugates. A 
typical conjugate pair reads 

an 

e’’“‘ + 

[125] 

Now if one writes 

i 

On - jbn 
= 2 

[126] 

then, according to Eq. 123, 



a. 

On jbn 

2 

[127] 

The pair of terms given in Eq 

. 125 then becomes 


+ OL-ne~ 

= dn cos nx + bn sin nx 

[128] 

whereas the separation of Eq. 

122 into its real and imaginary parts yields 

1 

On = - 

TT 

J /({) cos </{ 

[129] 

and 



bn=- 

TT 

pa+2r 

/(?) sin 

[130] 


Since ^0 = 0, and hence «o = y , Eq. 128 shows that the series 124 is 
equivalent to 

/(.r) = Y + cos X a 2 cos 2.v + • • • 

+ bi sin X + b 2 sin 2x + • • • [131] 

which is the familiar form for the Fourier series in terms of its sine and 
cosine components. The formulas 129 and 130 are seen to be identical 
with those given by Eqs. 70 and 73. The equivalence between the com- 
plex form 121 and the ordinary form for the Fourier series is thus estab- 
lished, and the formula 115 for the coefficients of the Laurent expansion 
is seen to contain the formulas for the Fourier coefficients as a special case. 

The complex form for the Fourier series is more convenient than the 
ordinary form for many manipulations because of its relative compact- 
ness as contrasted with the sine and cosine form, and also because of the 
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greater ease with which the exponential function may be manipulated 
with regard to various algebraic as well as differential and integral oper- 
ations. In this connection it may be obsers’ed, incidentally, that the 
complex form requires only the one formula 122 for the evaluation of 
its coefficients as contrasted to the two formulas, 129 and 130, which are 
ordinarily needed. 

The relation between the complex coefficient and the real coefficients 
an and bn is further illustrated by the sketch in Fig. 7 which shows a 

pair of conjugate complex coefficients 
an and a_„. These may be regarded 
as the conjugate complex terms 
and for a: = 0. As the 

variable x increases from the value 
zero, the vectors which these terms 
represent rotate in opposite directions 
at an angular rate of ft radians per 
unit of X. They, therefore, remain 
conjugates for all values of x, so that 
their vector sum becomes a real, 
simple harmonic function of x. For 
X = 0)1 this pair of conjugate terms 
is thus seen to represent a simple har- 
monic function of the time / with an angular frequency of tio) radians 
per second. 

Denoting the angle of the complex coefficient an by as indicated 
in Fig. 7, that is, writing 

<Xn = \(Xn\ a-n = \oCn\ [132] 

one finds that the expression for this simple harmonic function is given by 

= 2\an\ cos {nx + <t>n) [133] 

Equations 126 and 132 give 

dn -jbn = 2|ani ^OS 4>n +y2|a„| sffi [134] 



Fig. 7. A pair of conjugate complex 
coefficients of the complex Fourier 
series. 


or separating reals and imaginaries this yields 

flyi = 2 1 0(71 1 cos 071 

bn = ~2la,»| sin 0,7 


[135] 


Comparison with Eqs. 93 and 95, shows that the resulting harmonic 
amplitude Cn is equal to twice the magnitude of the complex Fourier 
coefficient whereas the harmonic phase angle is the angle 4>n of the 
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complex coefficient an- These results are also evident from inspection 
of Fig. 7. 

The complex form for the Fourier series is thus seen to contain the 
phase angles of its various harmonic components by virtue of the complex 
character of the complex coefficients. The formula 122 for these coeffi- 
cients yields these harmonic phase angles as well as the corresponding 
amplitudes 2|an], and the complex form of the series, as expressed by 
Eq. 121, contains both the amplitudes and the phase angles although the 
latter do not appear explicitly. This fact, namely that the harmonic 
phase angles are implicitly contained in the complex form 121, and hence 
need not be explicitly written down during any manipulations which 
may be carried out with the series, is one of its greatest labor-saving 
virtues when the Fourier series is used in various analytic formulations. 
This advantage becomes particularly apparent from the discussions given 
in Art. 9. 

Inasmuch as the basis for the Laurent expansion requires that the 
function fiz) be regular and continuous throughout the annular region 
enclosed by the circle with radii ri and ro in Fig. 6, one should expect 
that the validity of the Fourier series would be similarly restricted. 
Although this restriction, in general, holds if one expects uniform con- 
vergence of the series for all values of its independent variable, it is found 
nevertheless that the conditions for the possibility of obtaining a Fourier 
series representation for a given function are somewhat less confining if 
certain reservations are accepted regarding the convergence of the result- 
ing series. 

The conditions, known as the Dirichlct conditions., under which a 
Fourier series representation for a given function is possible, state that 
throughout the fundamental range for which the function is defined, it 
shall possess a finite number of maxima and minima, and in spite of having 
a finite number of discontinuities and points vn here the function becomes 
infinite, it shall possess an absolutely convergent integral; that is,* 

jT [136] 

shall be finite. If, according to the latter part of this statement, the func- 
tion becomes infinite at some point, this infinity shall be integrable, for 
example, like a logarithmic infinity. (It should be recalled that the area 

*Thc absolute convergence is required only if the integral ^ f{x) dx becomes improper. 

Actually, the conditions set down by Dirichlet are more stringent than those just stated, and 
do not permit the function ;(a:) to become unbounded. The conditions stated here are, never- 
theless, designated by man. writers as the Dirichlet conditions, and inasmuch as it is a con- 
venient way of referring to iher., this somewhat inaccurate practice is used here also. 
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under a logarithmic infinity is finite.) The series converges uniformly for 
all values of the independent variable except those correspx)nding to 
points of discontinuity, and at the points corresponding to infinite values 
for the function the series can, of course, no longer converge. 

At a point of discontinuity, which may be denoted by jr = Xi, the 
Fourier series yields the average of the two values of the function imme- 
diately adjacent to this discontinuity, that is, the series yields the value 

f{x, - 0) +/(.r, + 0) 

2 

The reason for this property of the series is discussed in Arts. 12 and \3. 
Together with a number of other interesting items, it is illustrated by 
some of the examples of the next article. 


-2t Oj 2jr 4ir Sir 

Fig. 8. The saw-tooth wave is basic in the description of discontinuities. 



[137] 


8. Several illustrative examples; a criterion regarding 

THE RATE OF CONVERGENCE 

The first example to be considered is shown in Fig. 8. This function 
h{x) has discontinuities of unit magnitude which occur at the origin and 
at integer multiples of 2ir. It is similar to the saw-tooth wave shown in 
Fig. 3 except that it is turned over and raised so as to lie upon the x-axis. 
This function may be defined by the statements 

Hx) = ;;; — for — ir < ai; < 0 

[138] 

= z + - for ft < « < T 

Z Ltc 


It is observed, therefore, that except for the constant component the 
fimction is odd. Hence its Fourier series is given by plus a sine series 
whose coefficients are evaluated by the formula 85. These coefficients are, 
therefore, given by 



— x) sin nx dr 


[139] 
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The integration yields 


so that the resulting Fourier series becomes 

,, , 1 , Ifsina: . sin2x , sm3* , 1 

4W-2 + ;L-r + — + — + •••] 

At the points jc = 0, 2ir, 4 t, • • • — 27r, — 47r, • • • etc., the value of the 
series is 3^^, which is the arithmetic mean between the values of h{x) 
immediately adjacent to these points of discontinuity as stated by the 
expression 137. For the immediate vicinity of these points the series 
cannot be said to converge uniformly because it yields different values, 
depending upon the direction from which these points are approached, 
and hence the partial sums do not converge toward a definite limit in this 
vicinity as n is indefinitely increased. 

In order to investigate the convergence for other values of x one may 
regard the partial sum 

, . Ifsino; sin2jc . . sinwrrl . 

>.(.) -;[— + — + ■■• + —] ['«! 

and form 
|5n+t(:«:) - •yn(a;)| 

1 sin {n + l)a: , sin (m + 2)x sin (« + k)x 

= ; n + 2 ■ + ••■+ n + k ^143] 


sin (« + ^)a:| 


n k 


Letting 


<Tn = sin x -f sin 2 jc + • • • + sin nx 


one may write the expression 143 

kn+*(a:) - J„(a;)| 

_ A gn-fl ~ g'n , g‘n4-2 
T « + 1 n + 2 


+ ...+ 


O'n+fc 1 

n + k 


The right-hand side of this equation may be written alternatively in the 
form 

^ nTT («+ 1 “ «n) (» + 2 “ w+l) ■'■ ■ ■ ■ 

+ („ +1 1' 1 - ;r^) + 
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Now, according to Eq. 12, 

X X 

cos - - cos (2« + 1) » 

1 [147] 

2 sin- 


Although, for various values of n and x, this sum <rn may have a variety 
of positive or negative values (in particular for x = 0 or a multiple of tt 
it has the value zero), it is clear that a finite positive quantity S may be 
found which the absolute value of On cannot exceed for any x and any n, 
no matter how large the value of the latter may be chosen. In other 
words, it is possible to specify that 

lo-nl < 5 for any x or n values [148] 


in which 5 is finite. 

Observing that the coefficients of the partial sums 
in the expression 146 are positive, and that an may have a numerically 
negative value while the remaining partial sums are at the same time all 
positive, one sees that, even in these most unfavorable circumstances, this 
expression nevertheless cannot have a value in excess of 


1 _1 LrJ! L. _ 1 1 

w+l”^w+l n-\-2 n+2 n-\-k 

Hence it is established that 

2S 

l5„+*(a;) - j„(.r)| < , , for all n and k 


S 

T 


2S 

ir(«+l) 

[149] 


[150] 


Consequently the statement that 

|in+t(iic) — s„(a:)| <€ independently of a; [151] 

for all values of k and all values n > N may be justified through choosing 


or 


25 


T{n 4 - 1 ) 


< € 


(»+!)> — 

ire 


[152] 

[153] 


which means 


ire 


[154] 


According to Cauchy’s principle of convergence (Eq. 94, Art. 9, Ch. VI), 
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therefore, the Fourier series 141 for the function illustrated in Fig. 8 is 
seen to converge uniformly over any range of :r-values which excludes 
the points of discontinuity. 

The results obtained for this simple example enable one to draw 
conclusions regarding the convergence of Fourier series for more arbitrary 
functions possessing a finite number of discontinuities. As a first step 
toward evolving such a generalization, a function / (x) may be assumed 
to have a single discontinuity equal to the value h at some point x = 
in the fundamental range a g x g a -f 2 t. In other words, the function 
/(.r) is continuous throughout this range except for a sudden jump 6 
(positive or negative) in its value at the one point x = x„. 

In terms of the function //(x) of Fig. 8, one may form a function 
h • //(x — x*,) which is a saw-tooth wave having a jump equal to 5 at the 
|)oint X = Xj. in the fundamental range. If this function is subtracted 
from /(x), the resulting function 

F(x) = /(x) - 5//(x - X,) [155] 

must be continuous throughout the fundamental range a ^ x ^ a + 2 t. 

With this result tucked away in one’s mind for future reference, at- 
tention is for the moment turned tow^ard the formula 122 for the complex 
Fourier coefficients, which is repeated here for the convenience of the 
reader 

1 /M+2ir 

«n = ^jf [156] 


Here one may apply the principle of integration by parts, for which the 
well-known formula reads 



J*u dv = uv -- J*v dv 

[157] 

Letting 


u =/(f) and dv = e ^^^d^ 

[158] 

one has 

du-f^H()d( and v - — 

[159] 


in which the superscKpt (1) on the function /(f) indicates that its first 
derivative is meant. Subsequently, the superscripts (2), (3), etc., are 
used to denote the second and higher order derivatives. 

Substitution into the formula 157 yields for the coefficient an of Eq. 156 


^^-|a+2x 

2vnj Ja 



di 


[ 160 ] 


an — 
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However, 

tt-f2» p—Jna 

= T— •!/(« + 2t) -/( a)} =0 [161] 

a 2TrflJ 

By repeatedly applying the same process, one may obtain the formula 

which, of course, is a proper integral only so long as the kth derivative 
(f ) remains finite, although it may be discontinuous. 

It may now be supposed that the given function /(x) and all its suc- 
cessive derivatives ••• are continuous, but 

that the i^th derivative P^\x) possesses the discontinuity 5 at a: = Xy, 
Then, according to the argument leading to Eq. 155, the function 

F^^^(x) = — 5 • h(x — Xv) [163] 

is still continuous, and hence for it the procedure leading to the integral 
162 can be continued at least one step further. Hence the Fourier co- 
efficients an for the function F^^^{x) must decrease in magnitude for 
large values of n at least as rapidly as the ratio 1/ Inasmuch as the 

illustrative example at the beginning of this article shows, however, that 
the Fourier coefficients for the function h{x — x^,) decrease with in- 
creasingly large n only as fast as the ratio l/«, it follows that those for 
the function can do no better although they are certain to do as 
well. These coefficients are given by 

1 /»o+2» 

^ jf di [164] 



and hence the coefficients for the function /(a;) are, according to Eq. 162, 
expressible as 


«n = 


«n 


(k) 


(»* 


[165] 


One may, therefore, conclude that for a function f{x) which, together 
with all its successive derivatives up to that of the ^th order, is continuous 
throughout the fundamental range, and for which, therefore, the Ath 
derivative is the first one possessing a discontinuity within this range, 
the Fourier coefficients are found to decrease in magnitude for large 
values of n as rapidly as the ratio 

Furthermore, one may conclude that, inasmuch as the Fourier series 
for A(x — x^) converges unifonnly over any region whose boundaries 
exclude the points x, =fc an integer number of 2 t^s, the series for the 
function (x) also converges unifonnly over the same ranges. In the 
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immediate vicinity of a point of discontinuity this series no longer con- 
verges uniformly, but its partial sums {x) in absolute value evidently 
remain smaller than some finite upper bound like the value S for the 
partial sum 144 or its closed form 147. Recognizing that the Fourier 
coefficients for the function (x) are those for the function {x) 
multiplied by \/jn, and following a procedure identical to that used in 
the investigation of the convergence of h{x), one is led to the conclusion 
that the series for/*“*^(jc) converges uniformly even over a range which 
includes the discontinuity of The series for the derivatives of 

lower order, and so on down to that for the function /(:r), then certainly 
do likewise. 

An important result of this line of thought is the fact that if a given 
function f{x) is itself continuous but possesses a discontinuity in its first 
derivative at some point the Fourier series for / (a*) still converges 
uniformly at the point .v = It may, therefore, unquestionably be 
integrated term by term, and it may also be differentiated term by term, 
although the point of discontinuity must then be excluded from the 
region of uniform convergence. 

Each time the Fourier series is integrated term by term, an additional 
factor n is introduced into the denominator of the expression for 
and each time it is differentiated, a factor of n is canceled out of the 
denominator of this expression. It is readily seen, therefore, that integra- 
tion improves the rate of convergence of the Fourier series, whereas 
differentiation makes the series converge more slowly. This fact may have 
been expected in the first place since integrating a function makes it 
smoother whereas differentiating it accentuates any existing irregularities. 

It is a simple matter to extend these conclusions to include functions 
having a larger number of discontinuities, as long as this number remains 
finite. For example, if the function /(x*) has the discontinuities 5i, 62 , 

• • • at the points Xi, X 2 , • • • x„ within the fundamental range, the func- 
tion 

= /(x) - 2 - X,) [166] 

i = 1 

is continuous throughout this range, whereas the Fourier series for that 
part of /(x) represented by 

i: iih{x - X.) [167] 

is given by the sum of v series, each of which has the same general form 
as that for h{x). The Fourier series for /(x), therefore, converges uniformly 
over ranges whose boundaries exclude the finite number of discontinuities, 
and the coefficients decrease in magnitude with large values of n as fast 
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as and no faster than the ratio 1/w. The series may be integrated term 
by term, yielding a series which converges uniformly throughout the 
fundamental range, but it cannot be differentiated term by term since 
the resulting series then no longer converges at all. 



Fig. 9. Two saw-tooth waves add up to yield a square wave. 

The second example to be considered is the rectangular wave with 
identical positive and negative half cycles of unit magnitude as shown 
in Fig. 9. This function has a discontinuity equal to 2 at the beginning of 
its fundamental range 0 ^ x ^ 2 t, and another equal to ~2 at the 
center of this range. The saw-tooth functions representing these two 
discontinuities are, therefore, given by 

2h(x) and — 2//(x‘ — tt) [lb8] 

After subtracting these from the given function in this example, one 
finds that there is nothing left, that is, 


f{x) ~ 2h{x) + 2h(x — tt) = 0 [169] 

Hence, according to Eq. 141, the Fourier series is given by 


. 2 fsin X sin 2x 1 

1 +-2 + ■ J 


2 r sin (x — tt) sin 2(x — w) 

~ irl 1 2 ■ 

] [170] 

With 


sin n(x — r) = ( — 1)" sin mx 

[171] 

this result yields for the Fourier series of the square 
Fig. 9 

wave shown in 

. 4 fsin a; , sin 3x , sin 5x , ' 

+ 3 + 5 +■■■. 

j [172] 


As one should expect from the fact that f(x) is odd, the representation 
is given by a sine series alone, and since the function satisfies the condi- 
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tion 98, only the odd harmonics are present. When the origin is shifted to 
the point x = 7r/2, the function becomes an even one and the series is 
converted into a cosine series. This shift is accomplished by replacing 
X in Eq. 172 by of -f 7r/2 , thus 

^ [i«i 

in which it is to be observed that the algebraic signs alternate. It may 
also be noted from Eq. 172 that, at the points of discontinuity of the 
function, the series yields the value zero which again is the algebraic mean 
between the immediately adjacent values of j{x). 




cos X 

~T” 


cos Zx 


+ 



J(X) 

' ... 

Msm 




^ rn 


1 

1 f 


Fig. 10. .V triangular wave which results when the rectangular wave of Fig. 9 is 

integrated. 

Integrating the series 172 term by term, one obtains the Fourier series 
representation for the triangular wave shown in Fig. 10 which is recog- 
nized as the result of integrating the square w^ave of Fig. 9. Thus, for 
the function of Fig, 10, 

. 4rcosa: . cos 3a: cos 5a: , 1 

+ _ + _ + [ 174 ] 

A point which may be a bit puzzling here is the fact that no constant 
term appears in the expression 174, whereas i+’ one mentally visualizes 
the function which represents the area under the square wave of Fig. 9, 
starting the integration at a* = 0, this function evidently turns out to be 
a triangular wave lying upon the x-axis, that is, having the form of the 
wave shown in Fig, 10 but with an additive constant component equal 
to 7r/2. The reason that this constant component is missing from the 
representation 174 is, of course, the fact that this result is the ifidefinite 
integral of the series 172 and not the integral from zero to some variable 
point x. Incidentally, the definite integral from tc/2 to any variable 
p)oint X has no constant component, as is readily recognized from inspec- 
tion of the Rg. 9. At all events one must recognize that the indefinite 
integral can yield only what is customarily referred to as the alternating 
component of the resulting function, and this in the present example is 
the triangular wave of Fig. 10. 
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It should be particularly noted that the series 174 converges consider- 
ably more rapidly than the series 172 or 173. Thus the coefficients in 
the series 174 vary as l/w“ whereas those in the series 172 vary only 
as 1/n. This observation agrees with what should be expected from the 
preceding discussion inasmuch as the triangular-wave function is con- 
tinuous whereas its first derivative, the square-wave function, possesses 
discontinuities. 

9. The Fourier spectrum 

The interpretations given in the present article are most conunonly 
used in connection with functions which are periodic with respect to 
the time /. The essential manipulations are, moreover, most effectively 
carried out in terms of the complex form of the Fourier series, which is 
expressed by Eq. 121 together with the formula 122. For x = ot these 
expressions are usually wTitten 

/(/)= i 

n = — ao 

with 

a„ = ^ [176] 

Zir c/— » ot 

Here it is useful to regard the general coefficient as a function of 
the variable mu, which is the angular frequency of the harmonic com- 
ponent of order n. Since n assumes only integer values, the variable no) 
is a discontinuous one, and the function an, which may alternatively be 
denoted as a (mu), has values only for discrete values of its independent 
variable. Nevertheless, it is useful to regard a(n<o) as a function rather 
than merely as denoting the values of the Fourier coefficients, since this 
view leads to an interesting and useful interpretation of the relations 
175 and 176. 

The variable / is regarded as the independent variable in a certain 
region designated as the time domain^ whereas the variable mu represents 
an independent variable in a corresponding region known as the frequency 
domain. The function which is given as /(/) represents a specification of 
some desired function in the time domain, and the corresponding func- 
tion a(mu), or an, found from the integral Eq. 176, is regarded as the 
specification of that same function in the frequency domain. In other 
words, the function a (mu) is in every respect just as complete and specific 
a representation of the desired function as is /(/), the only difference 
being that a (mu) represents that function in a different domain, namely 
that domain in which mu instead of / is the independent variable. 


[175] 
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This view that the function a(nw) is entirely and uniquely the equiva- 
lent of the function /(/) is indisputably tenable since the relation 175 
uniquely converts the function a{ma) into the function /(/), whereas the 
relation 176 does the reverse. In other words, these two expressions are 
a pair oi mutually inverse relations in the sense that one of them undoes 
what the other does. This circumstance may be placed even more clearly 
in evidence through substituting the expression 176 for an into 175, 
which operation incidentally requires writing for the variable of integra- 
tion / in Eq. 176 some other symbol w so as to avoid confusion with the 
independent variable t in the expression 175, thus 

m = £ f r'f /(«)«>’•“<'-“> du [177] 

n at 90 ZTT »■ /« 

By means of this expression, in which the variables / and u both refer 
to the time domain, the given function /(/) is expressed in terms of itself. 

The pair of relations 175 and 176 may, in the light of this interpreta- 
tion, be regarded as transformations which accompany the change of 
varial)le from t to ww or \ice versa. For this reason the function a{nw) is 
sometimes spoken of as the Fourier transjorjn of /(/), and the latter as the 
inverse transform of a(no)). In their present form the transformations 175 
and 176 are restricted by the condition that the function /(/) must be a 
periodic function defined over the entire time domain — oo < / < oo . 
The removal of this restriction and the accompanying modification in the 
Fourier transforms arc discussed in Art. 19. 

Just as the function /(/) may be represented graphically by being 
plotted in the time domain, so the equivalent function a{no}) may be 
plotted in the frequency domain. The latter domain is more commonly 
known as the frequency spectrum and the plot of a{fio)) as the spectrum 
representation of the function /(/). 

Inasmuch as the function a (//<*>) is in genen.l complex, it is necessary 
to make two plots in order to represent it completely. The ones usually 
chosen are the magnitude la„| and the angle 0„, although the real and 
imaginary parts could also be chosen. Plots of |an| and <t>n are called 
respectively the amplitude and phase spectra of the given function /(O- 
Since ma assumes only discrete values, these plots are not in the form of 
continuous curves but consist merely of a series of vertical lines repre- 
senting the ordinates of the functions \an\ and corresponding to integer 
values of n. For this reason they are referred to as line spectra. 

The square-wave function of Fig. 9, for example, yields, according to 
the formula 176, 


[ 178 ] 
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or, the change of variable / / — t/cj being made in the first integral, 




Integrating and substituting limits give 

(1 - c^'"")(l - (1 - cos tiTc) 


lirnj 


mj 


Hence 


f 0 for » even 
2 


jTTft 


for n odd 


[179] 


[180] 


[181] 


Substituting this result into the sum 175 and combining the conjugate 
terms yield the series 


m = 


L-i 


i«» 


which may be written 


and 


Lit sin 3a)/ sin 5a)/ 1 

[182] 


+ 5 +--J 

= 0 )/. The 

amplitude and phase functions are 

2 

m 

(for n odd) 

[183] 

- 

T tua 

(for n odd) 

[184] 

4>n = • 

T 

“2 

[185] 


Since the phase function is a constant, there is no need to plot it. The 
plot of the amplitude function 183 or 184 is facilitated through plotting 
first a continuous dotted curve for this function, assuming woj to be a 
continuous variable, and then erecting a set of ordinates from the tua-ajds 
to this dotted curve at the points corresponding to odd integer values of 
w, as is illustrated in Fig. 1 1 . The dotted curve in this example is simply 
a rectangular hyperbola. After this dotted curve is drawn, any number of 
lines are readily inserted. A plot of this sort gives one a good idea of the 
rate of convergence of the series, since the ordinates are proportional to 
the values of the various harmonic amplitudes. 

A second interesting example is given by the function shown in Fig. 12. 
This consists of a periodic succession of identical rectangular pulses of 
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unit amplitude and duration 5, with the time origin chosen at the center of 
one of them. Application of the formula 176 in this case yields 

h 

ncj- 

r-^ [186] 


-2T«yJ 


— 5/2 


sin 



Fig. 11. The amplitude function in the Fourier analysis of a rectangular wave. 


which is preferably written 


a 

sin - 
00} L 


Sin - 

a 2 






a 

no,- 


[187] 


Smce the expression is real, the phase function in this example is zero, as 
is to be expected from the fact that the function / (/) is even. 



Fig. 12. A periodic succession of rectangular pulses. 


The amplitude function has the form (sin %)lx which is equal to unity 
at x = 0 and zero at integer multiples of ir. The general appearance of 
this function is that of a harmonic oscillation of decreasing amplitude, 
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although it is, of course, not simple harmonic. At the point x tt/I the 
value is 2 /t; at jc = 3t/ 2 it is (1/3)(2 /t); at x = 57r/2 it is (l/5)(2/ir), 
and so forth, the maxima and minima lying approximately at the points 
X = 3ir/2j 5t 2, • • • . With these items in mind a dotted curve showing 
the function 187 versus ww is readily plotted. It remains to draw in the 
ordinates corresponding to integer values of w. Before this can be done 
one must choose a particular value for the ratio of the duration 8 of the 
impulse to the fundamental period t. For the present example, this ratio 
is chosen as 



Fig. 13. The amplitude function in the Fourier analysis of the succession of 
rectangular pulses of Fig. 12. 


and since t = 2ir/w, this choice corresponds to a fundamental angular 
frequency of 


1 2ir 

5* 8 


[189] 


As shown in Fig. 13, the point tua = 2ir/5 marks the first zero of the 
dotted curve, and since the fundamental frequency corresponds to w = 1, 
this point is located one-fifth the distance from the origin to the point 
fua = 2t/8. The various ordinates for other integer values of n are then 
readily drawn. For the particular choice indicated in Eq. 188, the ampli- 
tudes of the fifth, tenth, fifteenth, • • • harmonics are zero because these 
points coincide with the zeros of the dotted curve. 

It is to be observed that the form of the dotted curve is the same 
regardless of the ratio 8/ t, and that only the spacing of the lines in the 
spectrum is dependent upon this value. Incidentally, the amplitude of 
the constant component, corresponding to « = 0, is seen to be the largest. 
All amplitudes are proportional to the duration 3, so that as the ratio 
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B/t is decreased, r remaining constant, the dotted curve remains intact 
except for a change in the scale factor for its ordinates. 

10. Power products and effective values 

In the discussion of problems in electric circuit theory, determining 
the average value of the product of two periodic functions is at times 
necessary. Since this product is again a periodic function with no larger 
fundamental period, it is sufiicient to restrict the evaluation to one period. 
If the two functions are denoted by /i(/) and /2(0> the problem calls 
for the evaluation of the integral 

/ = [190] 

This evaluation is particularly simple if the complex form is used for 
the Fourier series representation of the functions /i(/) and / 2 (/). These 
functions may be expressed as 

hit) = i [191] 

n — • 

and 

MO = ^ [192] 


The product is given by the double sum 

MO -MO = i. [193] 

n, fi» = — » 

Substituting this expression into the integral 190, and assuming that 
the series 191 and 192 are uniformly convergent so that the integration 
may be carried out term by term, one has 

/ = £ anffm [194] 


The integral involved here is a very simple one. Its evaluation reads 



^y(n-hn)«< 


gi(n+m)2ir _ | 

j{n + m)u> 


[2t . 

— form = — « 

\ ^ [195] 

[ 0 for m 9 ^ — w 


Consequently all the terms in the double sum 194 vanish except those 
for which m ^ —n. The function 7, however, still requires an infinite 
sum for its representation, that is. 


J — £ 


[196] 
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which is the desired result. It may be written in the alternative form 
J = Oo/3o + 2 iot„ 8 —n + Ot—n0n) [197] 

n —1 

or, if one writes, 

[198] 

the result 197 is found to be equivalent to 

J = ao^o + 2 2 [ofn^n! (^n “ <^n) 

n «*1 

Because the integral 19vS is zero for m 7 ^ the interesting result 
follows that the average value 190, given by 199, dei>cncls only upon the 
products of the amplitudes of harmonic components of like order. In 
other words, none of the cross-product terms resulting from the product 
of the two series for/i(/) and / 2 (/) contributes to the average value of 
this product. In a physical problem in which /i(/) represents a source 
voltage and/ 2 (/) the resulting source current, J represents the average 
power delivered by the source. The integral 190 is, therefore, also referred 
to as the pauper product of two periodic functions. The coefficient 
cos {pn “ 0n), which occurs in Eq. 199, is in electric circuit problems 
called the power factor corresponding to the harmonic of nth order. 

A very closely related problem is the evaluation of the integral 

-^' = ^jf"“[/(0?«f/ [200] 

which is referred to as the mean square value of the single periodic func- 
tion /(/). This is, however, merely the integral 190 for /i (/) = / 2 (/) = /(/), 
and hence the desired result is expressed by Eq. 196 or 199 for 
Hence, if /(/) is assumed to be given by Eq. 191, 

r = 00 ^ + 2 i. |a „|2 [ 201 ] 

n »1 

The square root of this value is called the root-mean-square value or 
elective value of the periodic function /(/). Recalling, according to Eqs. 
94 and 135, that the harmonic amplitudes Cn in Eq. 95 are given by twice 
the magnitudes of the complex coefficients one sees that the effective 
value of the periodic function f{t) is given by the expression 

[/«)]rfr«uv. = ’ [202] 

which contains the particular result that the effective or root-mean-square 
value of a sinusoidal function equals its amplitude divided by v^. 
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11. wSUMMATION FORMULAS 

It is useful to note that the Fourier series for various particular func- 
tions may be used to obtain in closed form the values for numerous 
special forms of infinite series. For example, from the function h{x) of 
Fig. 8 and its series 141, or from the function /(x) of Fig. 9 and its series 
172, one obtains by setting x = 7r/2 the result that 

i = + + 

Similarly, since the series 174 for the triangular function of Fig. 10 is 
still uniformly convergent for x = 0, one has 

The evaluation of many other infinite series may be obtained in this way. 



Fig. 14. A plot of the function of Eq. 205, the Fourier series of which is used in 
several summation formulas. 


.It is of greater practical value to recognize that similar means may be 
employed for the determination of more general summation formulas. 
For example, one may consider the function f(.\ ) defined by 


/(x) = sin M 0 — IT < X < 0) 

f(x) = sin A* 0 < X < t) 


[205] 


in which m may have any value. This function is shown plotted in Fig. 14 
for the value m The function is even and satisfies the condition 
98. Hence its Fourier series is represented by a cosine series with odd 
harmonics, and the coefficients are given by the formula 105. Substituting 
from Eq. 205, one has 

a„ = - fix) (^osnxdx = sin - i^cosnx dx [ 206 ] 
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The integration yields 


sin w.r sin ^ ““ 2 m 


cos nx cos Ml;; — 


r(n2 - M^) 


2m (1 — cos tlir) cos M 

t(;/“ — M“1 


I 4m cos m 2 

a„ = ^ -TTS — -1\ (for « odd) 

T{n — fjL ) 

0 (for « even) 

The Fourier series for this function is, therefore, given by 


4m cos M 




2 “ cos nx 

imt O 

n =1.3,5. ••• n — 11 


which converges uniformly for all values of Jt* since the function f{x) is 
continuous. Substituting from the relations 205, one may write for the 
inter\’al 0 < x < t 


0 -*) 


= L 


4m cos M 




Particularly, for x = 0, this result yields 

ftanM'= £ 73-^ [211] 

4m 2 n=i.3,.5,... (n^ — M y 

which is the partial fraction expansion of the tangent function, and is 
found to be a useful summation formula in connection with a number of 
problems in circuit theory. 

Other formulas may be developed from the relations 210 and 211. 
For example, subtracting the former from the latter, and using the 
trigonometric identity 


1 — cos nx = 2 sin^ n ■ 


[ 212 ] 
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one finds 

« sin^ n - 
n=ilT5,--- n{n^ — n^) 

which is a somewhat more general summation formula than Eq. 211. 

Now writing the Eq. 213 first for /z = and again for ^ — 112 , and 
subtracting the second from the first, give a more general formula which 
reads 





IT jsin M ^ - sin M ^2 ~ ^)} 


8 m cos m ^ 


J(n\,x) -f(ti- 2 ,x) 
Ml^ — M 2 " 


= z 

n = 1 . 3 ,. 5 , ' 


sin^ n - 

2 


(«2 - Ml^)(»^ - M2*) 


[214] 


I'he quantities ju, m, and ^12 in these formulas may have any real or 
complex values. 

Formulas having the form of Eqs. 213 or 214 are useful in electric 
circuit theory because the quantities (w“ ~ have the form of the 
resonance factors of an impedance function. Since the latter is a rational 
function, that is, a quotient of polynomials, it is always factorable in 
terms of the roots of these polynomials. Hence the impedance is ex- 
j)ressible as the quotient of products of factors of this type. The expression 
for the |x)wcr absorbed by an electrical network which is excited by a 
periodic source containing only odd harmonics (this case is most com- 
mon) assumes the fonn of the infinite sum in Eq. 214 when the network 
contains two degrees of freedom. 

For H 2 = 0, a special form of Eq. 214 results, reading 


s'n^ n \ 

8/(/i,.v) - xrc ^ ^ 

n=i, 3 . 5 .v-- n^{n^ — y?) 


[215] 


Other formulas may be obtained through differentiation of the preceding 
ones with respect to .v. .As long as the coefficients in the resulting series 
decrease for large values of n as rapidly as, or more rapidly than, the 
ratio 1/n^, then, according to Art. 8, the periodic function represented 
by the series is continuous, and the series still converges uniformly for 
all values of x inclusi\’e of the boundary values in the relations 205. 
However, if the coefficients in the resultant series decrease for large 
values of n only as fast as the ratio I/m, the series cannot be used for the 
boundary values of x, although it still converges uniformly for all other 
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.r-values. For example, dilTerentiating the formula 213 with respect to x 
gives 


T cos n 



4 


COS/i~ 


to 

Z 

n-l,3.5.-“ 


n sin nx 


[216] 


which is evidently no longer valid for =* 0 or x = ir. However, differ- 
entiating the formula 215 with respect to x yields 


TT COS/i 




COS/x^ 


4/x^ COS fjL ^ 


n 



sin nx 
m(«^ - M*) 


[217] 


which converges uniformly for all a:-values. 

The formula 214 may he generalized so as to contain any number of 
factors {rr — jik^) in the sum. Replacing mi and respectively by ^<2 
and /i 3 and subtracting the resulting formula from 214 yield 

^ 

(mi^ — M3*) (M2* — — M3^) (M3* — Ml*) (M3^ — M2^) 

. X 

. Sin- H - 

""n-lS.... („2 -mi^)(«^-M2")(»=*-M3=*) 

Again, replacing in this formula by Ms, mi respectively and 

subtracting the resulting formula from 218, one finds 

^ f{tik,x) 

*-l (m*^ — MiM-l^)(M*^ — Mit+2*)(w^ “ Mi+3*) 

. 2 a: 

. sin^ n - 

“n-lS.- («" - MI*)(«* - M2^)(«* - M3^)(n=* - M4*) 

in which the subscripts k = 1, 2, 3, 4 are assumed to form a cyclic group. 

Any desired further generalization is evidently possible. For example, 
by using Eq. 213, and forming the function af(jiijx) + hf{^i 2 ,x)j one 
obtains a formula which differs from Eq. 214 in that the terms in the sum 
contain a factor of the form {n^ — in the numerator in addition to 
factors of this form in the denominator. 
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12. The “ least squares ” approximation property of the 
Fourier series 

In this article the problem of finding a trigonometric series representa- 
tion for a given function / {x) is approached in a different manner. The 
trigonometric series which is to approximate the function /(a:) is assumed 
to be finite and is written in the complex form 

Snix) .-= i: [ 220 ] 

k = — n 

The following; question is now raised: What must be the values of the 
coefficients ak in order that the mean square error between fix) and 
Sn (x) may become a minimum? In analytic form the mean square error, 
expressed for the fundamental range, reads 

1 /'a-f2T 

£ = — / [/(.r) - [221] 

Zir 

The problem, therefore, is to determine the coefficients in Eq. 220 in 
such a way that the expression 221 becomes a minimum. 

Now 

[fix) - s„{x)Y = [f(x)y -h [s„(a:)]' - 2[/(x)5„(a:)] [222] 

According to Eq. 220, 

[Sn(*)]'*= Z [ 223 ] 

<,& = — n 

so that 

1 /»a4-2T 1 ” /»o+2ir 

^ [s„ix)]Ux = :^ T am [ 224 ] 

The integral appearing here has the same form as that in Eq. 195 of 
Art. 10. Hence 

^ r^‘’ [s„ix)Y dx = i. ana-t [ 225 ] 

2 Tf k = — n 

Next one needs to evaluate the integral 

T.sr ' Is *1 

The quantity enclosed by the curved brackets is, according to Eq. 122 or 
Eq. 156, recognized to be the expression for the complex Fourier coefficient 
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Ok, SO that 


It 



f(x)s„{x) 


dx 



ClkOt-k 


[227] 


Substituting Eq. 222 into Eq. 221, and taking note of the results stated 
in Eqs. 225 and 227, one obtains for the mean square error the expression 

E ^ ^ [f{x)]Ux + i a* 0 -t -2 f [228] 

TT •/<! k * — n A' => ~n 


Now it is obser\’ed that 

2 2 ^kOi—k = 23 + tf-w) [^29] 

Jfc = — n /; - -ri 

The last two terms in Eq. 228 may, therefore, be written in the form 

n n 

2 (aka^k — — <i-kOik -f cifca^k) — E otkot^k [230] 

k = — n k -- n 

which is equivalent to 

n 'I 

(Ok — ock){(i-^k ” “ 23 ocka^k L231] 

/; = — n A = — n 

or, since the coefficients with negative subscripts arc the conjugates of 
those with the same positive subscripts, these terms are given by 

S i [232] 

fc = — n k^ —n 

Hence the mean square error, according to Eq. 228, is seen to be given 
by 

E = ^ P'’[/(r)rdx+ Z i ki® [233] 

JLlC %Ja k^—n — n 

Inasmuch as the function j{x) and hence also the coefficients ak are fixed, 
the expression 233 evidently becomes a minimum for 

ak = oLk [234] 

whence one may conclude that the sum 5n(x) as given by Eq. 220 approxi- 
mates the stated function /(:r) over its fundamental range so as to make 
the mean square error a minimum ij the coefficients in the finite sum Sn(x) 
are the Fourier coejficients for the function f (x). 

When the coefficients a* are so determined, the mean square error 233 
becomes 
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which, according to Art. 10, is 


-E= Z Z kP [236] 

fc=— « *=— n 

Thus it is seen that the mean square error tends toward zero as n becomes 
larger and larger. In other words, the infinite Fourier series approximates 
the given function j{x) in such a way as to make the mean square error 
over the fundamental range vanishingly small. The Fourier series, or 
any of its partial sums, is, therefore, said to approximate a given function 
in the “ least squares ** sense. 

In view of this result it is easy to show why, at a point of discontinuity 
of the function /(x), the Fourier series always yields the arithmetic mean 
between the two values of /(x) immediately adjacent to the point of dis- 
continuity. Thus, if these two values are denoted by a and 6, and the 
value of the series at the point of discontinuity is denoted by 5 , the mean 
square error at this point is expressed by 

+ _ (, - 

If this is to be a minimum, its derivative with respect to s must be zero, 
that is, 

5 — d + 5 .— b = 0 [238] 

which yields 


13. The approximation property of the partial sums; the 
Gibbs phenomenon 

When, in a practical problem, a trigonometric series is used to approxi- 
mate a given function, this series must for obvious reasons be a fimte one. 
Consequently, it is of considerable interest to know more about the 
detailed manner in which the partial sums of a Fourier series approximate 
a prescribed function. 

A partial sum reads 

Sn{x) = z [240] 

I:*— n 

in which 
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Substituting this expression for a* into Eq. 240 and reversing the order of 
summation and integration yield 

^n(^) = f rm i [242] 

ZTt/0 JL=— n 


Now 


i ^ — j) -j- COS 2{x — J) + • • • -f cos n{x — f) 

[243] 


and, according to Eq. 11, this gives 


£ = 

fc a* n 


sin (2 h + 1 ) 


- £ 


sin(i^) 


[244] 


Substituting this result into Eq. 242, one finds that a partial sum may be 
expressed as 

Sn{x) = ^ jf 7«)« (f - x) [245] 

in which 


«({ - *) 


sin I (2« + 1) 



'■246] 


This function is shown plotted versus f in Fig. 15 for the particular 
choice n = 5. It consists of a periodic succession of identical large humps 
with the amplitude (2n + 1) and the fundamental period 27r, separated 
by a number of smaller oscillations which decrease in amplitude with 
increasing distance from the large humps, the smallest of these having 
unit amplitude and occurring midway between any two of the large 
humps. It is observed that only one large hump is encountered throughout 
the fundamental range 0 ^ a: ^ 2 t, and this occurs at the point { = :r. 

According to the relation 245, the partial sum Sn{x) is given by the 
product of this peculiar function w($ — x) with the given function /({) 
integrated over the fundamental range. The integration ’must be evalu- 
ated separately for each value of x for which the corresponding value of 
Sn{x) is desired. During each integration, x is treated as a constant. The 
function — x), which contains x as a parameter, remains the same 
for various jr-values except for a translation of this function as a whole. 
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'Fhe value of x merely determines the location of the large hump in the 
fundamental range. 

Figure 16 illustrates the function — x) for a fairly large value of n, 
and shows its position in the fundamental range relative to the given 
function /($) for some arbitrary x-value.* For the interpretation of 
the integration in Kq. 245 one may visualize the plane of Fig. 16 as a 
screen on which the function M(f — x) is plotted and of which the portion 
corresponding to the shaded area under this curv'e is cut out. The func- 
tion /({), which is plotted on a second screen located behind the first, is 



Fig. 15. 'Fho scanning function used in a study of the Gibbs phenomenon. 


assumed to be visil)le only through the portion of the first screen which 
is cut away. For a large value of ;i, this cutaway portion is essentially 
in the form of a long narrow slit corresponding to the area under the 
large hump in the function //($ — x). If one neglects for the moment the 
small areas under the minor oscillations or rij pies adjacent to this slit, 
and assumes further that this slit is sufficiently narrow so that the small 
rK)rtion of the function /($) visible through it may to a first approxima- 
tion be regarded as characterizing the single ordinate of this function 
for the point { = a*, it becomes clear that the value of the integral 245 
is given simply by the product of f{x) and the area of the slit multiplied 
by 1 2 t. If 1/2 t times the area of the slit equals unity, the value of the 
integral, to a good approximation, equals f{x). In other words, one arrives 
at the reasonable conclusion that Sn(x) ^f(x). 

*Since the ratio of the height of the large hump in the function u{^ x) to the width of 
the fundamental range is (2m + l)/2ir, it is not practically feasible to draw this figure, or 
Fig. 20, in correct proportion with regard to this ratio. These figures are, however, in correct 
projjortion with regard to the ratios of the amplitude of the large hump to those of the adjacent 
oscillations, *is well as with regard to the ratio of the width of the fundamental range to that 
of the large hump at its base. This ratio is equal to m + 
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The values of 5„(.r) for various values of .r are obtained through dis- 
placing the hrst screen in the horizontal direction so that the slit uncovers 
different portions of the curve /($) corresiX)nding to the point £ = x. 
This process of moving the screen with the slit corres{X)nding to //(£ — x) 
across the fundamental range, and viewing the function /(£) through it, 
is referred to as scanning /(£) with the function //(£ — x). The latter is 
called the scanning function. If this scanning function consists of a single 
rectangular hump of extremely small width but of such height that the 
area of the slit nevertheless is finite and equal to It, the integral 245 
yields almost exactly the function /(.v), the discrepancy becoming zero 



Fig. 16 . Pertinent to the evaluation of the partial sum of a Fourier series. 

as the slit width is allowed to approach zero. Actually, because of the 
deviation of the scanning function //(£ — x) from this ideal limiting 
form, the integral yields a function ^n(^) which is only approximately 
equal to fix). 

For a relatively small value of n (few terms of the Fourier series) the 
scanning function u{i — x) deviates quite considerably from its ideal 
form, and the partial sum 5„(.v), as one should exjxjct, only crudely 
approximates the given function f(x). As more terms are added to the 
partial sum, and n becomes relatively large, the form of the scanning 
function improves, and so does the degree of approximation between 
5n(^) and fix). The improvement in the scanning function is due to the 
large hump’s becoming taller and narrower, as is clear from Fig. IS, in 
which it is indicated that the height of the large hump equals {2n + 1) 
and its width at the base equals 4t/ {2n + 1). At the same time, the adja- 
cent ripples become larger in number and hence also become narrower, 
in the same proportion, in fact, as the large hump becomes narrower. 

One disturbing feature about the behavior of this scanning function 
with increasing «, however, is that the ripples immediately adjacent to 
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the large hump do not ultimately become smaller and smaller in amplitude 
relative to the height of the large hump but maintain amplitudes which 
asymptotically approach a finite ratio relative to the height of the large 
hump. The ultimate effect of this disturbing circumstance requires 
further careful investigation which is begun by study of the properties 
of the scanning function in greater detail. 

First it is significant to observ'e that the net area under the curve for 
%i(X — x) over the fundamental range is indeed equal to 27 r, and that this 
is true for any as may readily be seen in several ways. For example, 
according to Eqs. 24v3, 244, and 246, 

a:) = 1 + 2 cos ({-a:)+2 cos2(i— x)H \-2 cosn{^—x) [247] 

In evaluating the integral of this expression over the range 0 < f < 2 t 
it is obser\ ed that all the integrals for the cosine terms yield zero. Hence 

«({ -x)di = di = 2 t [ 248 ] 

This result may also be seen from the integral 245 through assuming as 
a special example that /({) = 1. Then Sn{x) must, of course, also equal 
unity; that is, the partial sum reduces to its constant term and this must 
have the value unity. The integral 245 then yields the result expressed 
by Eq. 248. 

Thus it is established that l/2ir times the area under the scanning 
function is equal to unity, as the preceding discussion indicates that it 
should be. However, this area is not confined to a single slit but is partly 
contributed to by smaller increments due to the ripples, which are alter- 
nately numerically positive and negative. A glance at the scanning 
functions shown in Figs, 15 and 16 reveals, moreover, that the net area 
contributed by the ripples is numerically negative so that the area under 
the large hump must exceed the value 2 t if tl ;e resultant area under the 
entire function is to have this value. The amount of the excess area 
under the large hump depends upon the value of «, but the important 
point is that this excess does not become zero as n is indefinitely increased. 

In order to comprehend this fact, one must study the behavior of the 
function w(f - x) for large values of n. To facilitate this study it is 
expedient to introduce in the form given by Eq. 246 the change of variable 

, = ( 2 « + 1 ) [ 249 ] 

Then the scanning function assumes the form 



[ 250 ] 
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The region of the main hump eorresix^iuis to — tt < t; < tt, and each 
adjacent smaller hump corresponds to an increment of tt in the variable rj. 
As indicated in Fig. 16, the region over which the adjacent ripples have 
signiticant amplitudes is conlineil to the more immediate vicinity of 
the large hump. In other words, the entire region of interest for the 
function //(t;) is confined to a finite interxal for the variable 7 ? which, for 
example, may be described as — IOtt < rj < IOtt, or something of this 
order. 

With regard to the exj)ression 250, if ;/ is sulliciently large so that 
approximately 

lOir TT 

>7+1 6 


little error is introduced in the determination of throughout the 
region of interest if tlie sine function in the denominator is replaced by 
its argument. In the limit n — ^ any error introduced by this procedure 


■n tf 



Fk;. 17. A plot of the function ilIL!? \vhich occurs manv times in Fourier analysis. 

V 

becomes vanishingly small. Hence, for large values of «, one is permitted 
to write for the scanning function 

«(^) = (2«+ l)(^) [252] 

The function (s\n ri)/rj has appeared before in these discussions (see 
Fig. 13 of Art. 9) and will appear again in a variety of problems. The 
reader may, therefore, well spx^nd some time becoming thoroughly 
familiar with its characteristics. It is plotted again in Fig. 17, w^hich 
illustrates some of its more immediately apparent properties. At the 
origin, t? = 0, the function has the value unity, and a zero derivative, 
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as is apparent if one writes the Maclaurin series for (sin r?)/!?, thus 
TJ ~ — 4- - - 

sinri ^ 31 51 , ri^ . 

= = 1 “ + • • • [2531 

V V 6 120 

from which it is also clear that the function is even. At the points 
T? = ±:2vy • • • the function passes through zero. Aside from the 

ma.ximum at ?? = 0, the function has further maxima and minima at 
the points where 

£(»;-’) = 0 P541 


tan 1} = V [255] 

The r(x)ts of this transcendental e(|uation are very nearly, and for suffi- 
ciently large r) almost exactly, equal to 

3w Sir It 


Actually they are slightly smaller than these values, but as a sketch of 
the relation 255 readily reveals, the discrepancies even for the first few 
roots in tlie set 256 are hardly noticeable in a graphical representation. 
'Fhe values of the.se ai)proximate maxima and minima are most con- 
veniently exj)ressed in terms of the value of the function for rj = ztir/2, 
which is 2 tt. The ai)proximate minimum at v = ±37r 2, which is a nega- 
tive value, is one third of 2 tt; the maximum at 77 = di57r 2 is one fifth 
of 2 t; and so forth. In other words, the maxima and minima in absolute 
value become smaller with increasing 77 like ti e ratio 1 77. For example, 
the fourth extremum ai It 2 has the value 2.1 t ^ 0.091, or only about 
9 per cent of the value of the function at 77 = 0. 

With regard to the area under the curv e of Fig. 17 it is significant that 
the function 

5i(x-)= [257] 

«/0 T7 


which represents the area under the curv'e for (sin r\)/r) from the origin 
to some variable point 77 = x is a familiar function in Fourier analysis 
and allied matters. A short discussion of some of its properties may well 
be given at this time. 

The shaded area in Fig. 18 represents the value of the function Si {x\ 
called the “ sine-integral of for a particular value of .v. The essential 
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properties of this function are easily obtained from inspection of this 
figure. 

First one may observe that for values of x near the origin, the area 
evidently increases very nearly, in fact for .v = 0 exactly, as .v. This 

circumstance may readily be seen by 
observing that the function (sin rj) rj 
has a zero derivative (the curve is 
horizontal) at the origin. Hence 



~d Si (.v)" _ 

(Ix Jx -0 


1 


[258] 


As the curce for (sin yj) rj begins 
to fall, however, the rate at which 
Si (.v) inc reases with x falls olT, and 
it eventually becomes zero at .v ~ tt. 
Here the function Si (.v) reaches its lirst maximum. From .v = tt to 
.V = Itt the 5/-fimction dec reases from its first maximum by the amount 
of the negative area enclc.^ed by this jHirtion of the (sin t?) t? curve. By 


Si(x) 



the same sort of reasoning, it is readily seen that the function Si {x) 
has the general character shown in Fig. 19. The alternately |X)sitive and 
negative areas contributed by the ripples in the function (sin r])/rj become 
smaller and smaller with increasing .r values, and the net area, therefore, 
approaches a finite value asymptotically and in an oscillatory manner. 

The as>Tnptotic value is reaclily obtained from the result stated by 
Eq. 248, namely, that the net area under the curve for the scanning 
function is equal to lir for any n and any x. This equation may be written 

J^M(f - x)d[i - x) = 2r 


[259] 
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or, with the use of Eqs. 249 and 250, one has 



2 ^(2r.+i)x/2 sin?; ^ 

2n 4“ 1 1 /— ( 2 n+i)» 2 , / V \ ^ 

sin 1 T 1 

\2« + 1/ 

[260J 

In the limit ;/ 

— ► » this result reads 



2 f' ~dv = 2tr 
«/— * rj 

[261] 

and since the function (sin 7])/rj is even, it follows that 



«/0 T) / 

[?62] 

which is the 

asymptotic value of the function Si (a:), 

as indicated in 


Fi-. 19. 

With regard to the evaluation of the integral 245 for large values of w, 
according to the scanning process illustrated in Fig. 16, several points of 
significance may now he brought to the reader s attention. First it is to 
be obser\*ed from Fig. 17, which according to Eq. 252 represents the 
apiK'arance of the scanning function for large values of that the smaller 
negative humj)s immediately adjacent to the large hump have amplitudes 
which remain slightly greater than one-fifth the amplitude of the large 
hump no matter how large n becomes. As already pointed out, the 
scanning function, therefore, does not approach its ideal form as n 
increases without limit, since the net area under the scanning function 
does not become equiil to the area under the large hump. The latter 
area is, for large values of //, equal to twice the area under the cur\’e of 
Fig. 1 7 between t; = — tt and rj = tt. ('Fhe factor of tu'o enters here because 
of the change of variable given by Eq. 249, is also seen in the steps 
leading to Eq. 261.) This area is equal to 4 5 (tt), which is found (from 
tables of ^/-functions) to be about 18 per cent larger than 27r. 

In the limit // — ♦ x , the scanning function may be visualized as result- 
ing when all the side oscillations in Fig. 16, together with the large hump, 
are compressed so as to form a single ordinate of infinite height. The ex- 
cess area under the large hump then virtually becomes coincident with the 
residual areas of the ripples, and since the net area always equals lir, the 
ultimate form of the scanning function may in a sense be said to have met 
the ideal requirements. Vet for any finite however large, there exists a 
departure from the ideal which has a definite effect upon the approxima- 
tion projxjrty of the partial sum Sn(x)- This departure becomes particu- 
larly marked in examples in which the given function /(.r) possesses dis- 
continuities. 
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In order to illustrate this fact a function /(.v) is assumed to be zero 
from a* = 0 to .v = .Vi, and equal to unity over the remainder of the 
fundamental range from x = xi to .v = Itt. ilie integral 245 then becomes 

Snix) = ^ r’u{^-x)d^ [263] 

ZT •/ti 

This expression may be replaced by the two integrals 

Sn{x) = ^ jT u{^ - x) dt - ^ jf ' u{^ - x) d% [264] 



Fig. 20. The scam.'ng function for n = 28 applied to a rectangular pulse. 


These manipulations are clarified by reference to Fig. 20, in which the 
scanning function for « = 28 as well as the particular function /({) con- 
sidered in this example is plotted. Thus it becomes clear that the first 
integral represents 1/2t times the area under half the scanning function 
and, therefore, has the value |. By use of Eqs. 249 and 252 in the second 
integral, the relation 264 becomes 


s»(*) = 


I 


-f 


(2i,+l)(»|-*)/2sin77 


2 xt 

which, according to Eq. 257 is equivalent to 


V 


dri 


^n{x) = - + - Si [(n + — :ci)] 


[265] 


[266] 


This expression shows how the discontinuity of the function f{x) at 
is represented by the partial sum The second discontinuity 
a,t X = 2 t is represented in an exactly similar manner, with the result 
that the partial sum for this periodic function /(x) and the choice w = 28 
assumes the form shown in Fig. 21. 

It must be admitted, of course, that a discontinuity in the function /(:r) 
offers an extreme test of the ability of the partial sum s„(x) to approxi- 
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mate this function. In other words, for a continuous function /(^r), the 
partial sum Sn{x) with the same number of terms yields a vastly better 
degree of approximation. Nevertheless, it is practically significant to 
investigate the approximation property of the partial sum in the most 
adverse circumstances. In this respect one observ^es from the appearance 
of Fig. 21 that residual discrepancies remain even for very large values 
of n. As the latter is further increased, this figure is changed only in that 
the ripples in the vicinity of the discontinuities of f{x) show a propor- 
tionately increased rate of oscillation versus the variable x, uhereas their 
amplitudes relative to the magnitude of the discontinuity remain the same. 



Fig. 21. The result of the operation illustrated in Fig. 20. 

In the limit « — > « , these ripples are compressed into a single vertical 
line at the point of discontinuity, but even in this limit the Fourier series 
is still observ’ed to yield the overswing of 18 per cent* which is charac- 
teristic of the function Si (.v). It is true, of course, that in the limit 
this overswing together with the adjacent ripples occupies zero space in 
the fundamental range, so that practically speaking one may say that no 
residual discrepancy between the series and the function remains. Never- 
theless, the phenomenon is noteworthy from a mathematical standpoint 
inasmuch as it illustrates the ultimate effect of the failure of the scanning 
function properly to approach its ideal form in the limit « — > oo . It is, 
moreover, of practical concern also since it reveals the disheartening fact 
that, by means of a finite jx)rtion of a Fourier series, a given function can 
never be approximated in the vicinity of a discontinuity wdth a tolerance 
less than the characteristic 18 per cent overswing, no matter how many 
terms one may be willing to use. 

This peculiarity of the Fourier series, referred to in mathematical terms 
as the Gibbs phenomenon, is a very real disadvantage with regard to 
certain practical problems. Because of this phenomenon, it is necessary 
in connection with certain approximation problems to use other types of 
trigonometric series which are appropriate modifications of the Fourier 

•This percentage b based upon half the value of the discontinuity. In terms of the whole 
discontinuity the overswng is 9 per cent. 
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series. An imi.x)rtant modilication of this sort is discussed in the next 
article. 

14. Approximations by means of I'ejer polynomials 

The appearance of the Gibbs phenomenon in the Fourier series is evi- 
dence of the failure of the uniform convergence of that series in the 
vicinity of a discontinuity of the given function. In other words, the 
partial sums 5,i(.v) no longer converge toward a definite limit with increas- 
ing ;/ for the immediate vicinity of a point of discontinuity of/(.r). It is 
the object of the present article to show that this failure in the uniform 
convergence of the series, and the associated Ciibbs phenomenon, may be 
removed if the sequence of the partial sums is replaced by the 

arithmetic moan sequence defined by Eq. 146 of Art. 9, Ch. VI. 

This sequence reads 

5o(a 0 = 5o(.r) 

o / N ) -f 5i (.r) 

— 

^ ^ , 5n(.v) -h si(x) + 

So^x) = ; [2671 


L Sn(x) 

Substituting from Eq. 240, and utilizing the notation 

= [268] 
one recognizes that this sequence may alternatively be expressed as 
So{x) = ao = y 
Si(x) = ao + 5C1 cos (x + <tn) 

Siix) = ao + |ci cos (a; + <^.i) + cos {2x + 4>i) [269] 


S,{x) = «o + Cl cos {x + <i>i) + — C 2 cos {2x + ^ 2 ) 

+ • • • + b, cos (w + <i>,) 


V 
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The coefficients ak are the Fourier coefficients defined by Eq. 241, and 
hence the partial sums appearing in Eq. 267 are, according to Eqs. 245 
and 246, given by 


" Ir So 


sin I (2n + 1 ) 


. U--v| 


■ [270] 


For the following manipulations it is now useful to obser\'e that 
sin I (2« + 1 ) ’’--r— 


sm I - 




sin I (2n + I ) | sin 

sin- ('-^) 

cos ;/(J — .v) — cos (;/ 4- 1)(J — x)] 


2 sin 


in' (4^) 


By use of Eq. 270 in the last of the relations 267, it is found that 
1 r*' s Jl, cos ;/({ — .v) — cos [(n + l)(f — x)] 


Sr(x) = - / m L ■ 

Zt «/() n =0 


2{v + 


1) sin' ('-y:) 


[271] 

[272] 


But 


2 (cos«({— cos [(« + l)(J-a-)]| =1— cos (|-a)+ cos ($— a) 

n =0 

— cos 2(J — x) + cos 2(1 — a;) — • • • — cos {v + l)(t — x) 

? - .T 


= 1 — cos (j' + 1)(J — a:) = 2 sin^ | (j' + 1) ' 
Hence Eq. 272 becomes 

S,{x) = -di 

in which 

Ln^-lV 


I’($ - x) = 


X + 1 


iin I 


sin j (v + 1) 


V 


sin 




[273] 

[274] 

[275] 


/ 


The expression 274 for the trigonometric polynomial S,{x) is identical 
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with the form 245 for the partial sum excei>t that the scanning 

function now is — -v\ as given by Kq. 275, instead of //(^ — x) as 
given by Eq. 246. Comparing the function //($ — x) with — .v) shows 
that the latter is essentially the square of tlie former. With v = 2;/, this 
simple relationship between u[^ — .v) and — .v) is true except for the 
factor 1 (r 1) in the expression for — .r). But for this scale factor, 
the plot of Eq. 275 for v — 10 is simply given by the square of the function 
— x) plotted in Fig. 15 for n = 5. The result is illustrated in Fig. 22. 



Fig. 22. A plot of the scanning function th.U results when Cesaro summation is 
applied to a Fourier series. 


A comparison of Figs. 15 and 22 clearly reveals the superiority of the 
scanning function cmJ — .v) over the function //({ — x). Most significant 
in this respect is that the present function has no negative values, from 
which it follows that, at a point of discontinuity of the function fix), the 
partial sum SJx) cannot overswing. Moreover, the ripples adjacent to 
the large hump in the function v(^ — .v) have considerably smaller 
amplitudes than those for the function //fj — .v). 

The net area under the scanning function vi^ — .r) for the fundamental 
range is again equal to 2x, as may most easily be seen through considering 
the integral 274 for the special case /({) = 1, whence Sy{x) must also 
equal unity. Since the function ?({ — x) has the period 2x, it is thus 
established that 


1 

2w{v + 1) 



di ='l 


[276] 


for any x and any v. 
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With the change of variable 


ri = {v A- \) 


f - ^ 


the function 275 becomes 


I V 


V 1 


1 Sin I 

V ^ 


[277] 

[278] 


For large values of v the scanning function is, therefore, ver\' nearly 
given by 

'.■(^) = (^+ [279] 

The integral 276 then reads 

and for > x one has 


From Eq. 279 it is found for large ^-values that the area under the 
large hump in the scanning function i’(J — .v) is equal to approximately 
90 per cent of the total area of 27r, and that the combined area under the 
two small humps on either side is about 5 per cent of the total area. 
The amplitude of the small humps immediately adjacent to the large one 
is about 4.5 per cent of the large amplitude. These ratios hold for reason- 
ably large finite values of v (about 10 or more^ and do not vary as v is 
increased without limit. 

With this scanning function, the function /(^) of Fig. 20 yields a 
partial sum according to Eq. 269, which has the form shown in 

Fig. 23. In order that this figure may correspond to the same width of the 
large hump and of the adjacent ripples relative to the fundamental range 
as for the scanning function of Fig. 20, it is drawn for v = 56, which is 
double the ;z-value chosen for Fig. 20. In other words, it is necessary'” to 
have twice as many terms in the partial sum 269 as in the partial sum for 
the Fourier series in order to obtain the same slope at the points of dis- 
continuity of the function f{x). However, the Gibbs phenomenon is now 
completely suppressed. Instead, the increment in Sv{x) near the point Xi 
equals only about 90 -per cent of the value of the discontinuity, corre- 
sponding to a displacement of the scanning function — x) by an 
amount equal to the width at the base of its large hump. A succeeding 
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displacement equal l(^ the width of the next adjacent small hump yields a 
further increase in S,a') of about 5 per cent, and so on. 

The essential difference between the behavior of the partial sum 
Sy{x) and that of 5 n(*v) in the vicinity of a discontinuity of the function 

Sy(x) 




Fig. 23. The result of scanning a rectangular pulse with the scanning fuiu tion of 

Fig. 22. 

f(x) may be seen from a comparison of the function Si (.v), defined by 
Eq. 257 and illustrated in Fig. 19, with the function 

Q(x) = £ dv [282] 

The latter is shown plotted in Fig. 24, in which the dotted (urve rep- 
resenting Si (x) is also drawn in order to facilitate the comparison of 
these two functions. They both have the same asymptote, but Si (x) 
converges toward this value in an oscillatory manner, whereas the 
function Q(.v) approaches it monotonically. 

The conclusion is that the so-called Fejer sum S^.(x), given by Eq. 269, 
converges toward a definite value as v is indefinitely increased, even at 
points where the given function /(.i) is discontinuous. In other words, the 
Fej6r series, which is given by Sy(x) for — > x , converges uniformly over 
the entire fundamental range even when the function f{x) which it 
represents possesses discontinuities. 

It is interesting to obseiv^e, according to the relations 269, that for 
large values of v the coefficients in the sum 5 v(a') differ appreciably from 
those in the sum Sn(x) only for the higher harmonics. In other words, the 
coefficients of the initial terms are practically the same in the Fejdr sum 
as they are in the partial sum of the corresponding Fourier series when 
the total number of terms is large. This fact means that as the number 
of terms becomes infinite, the coefficients for any finite number of terms 
are actually identical. The two resulting infinite series differ only in their 
terms of infinite order.* 

*This is admittedly a rather loose way of referring to the higher order terms in a partial 
sum as the number of terms in that sum is allowed to increase without limit. 
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From a practical point of view this circumstance may at first sight 
seem trivial, inasmuch as the terms of infinite order can never be reached 
in any term-by-term calculation. Nevertheless, there remains a significant 
difference in the behaviors of the two series, in that one of them exhibits 



Fig. 24. F^sscntial (lilTcrcncc in the behavior of the partial sum of a Fourier series 
and that of a series of Fcjer polynomials in the vicinity of a discontinuity. 


the Gibbs phenomenon at any point of discontinuity and the other does 
not. This dilTerence is due precisely to the difference in the terms of 
infinite order because these are the ones which alone are significant in 
determining the approximation properties of the series in the vicinity 
of a discontinuity. 

15. Fourier analysis by graphical means 

In practice it frequently occurs that the function j{x) for which a 
Fourier series representation is wanted is available in graphical form 
only. Usually also in problems of this sort only a partial sum s^ix) is 
sought which approximates the function /(.v, with a certain stated 
tolerance. Before discussing a possible method of solution, it may be well 
to point out that the precise requirements of the desired solution in a 
problem of this kind are frequently not clearly stated, and hence that 
considerable confusion regarding the value of a particular solution may 
result. 

The problem is frequently put ver>" roughly in the statement that 
a finite trigonometric fxjlynomial of the form of Sn{x) is sought which 
approximates a given function, but usually nothing is said about the 
approximation properties of the desired polynomial except perhaps that 
the ‘‘ best approximation ” which a given number of terms can yield is 
wanted. Inasmuch as there are an infinite variety of ways in which a 
trigonometric polynomial with a given number of terms may approximate 
the required function f{x), the solution is decidedly not unique. For 
example, the polynomial may approximate f{x) so as to make the mean 
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and odd components /i (.v) and f 2 (x) respectively according to the rela- 
tions 80 and 81. d'he Fourier series lor these components read 

f\ {X ) = (7o ~h t /2 “h ^ * [283] 

J^'kx) = bi sin .V -f b '2 ^i^^ 2.v -f sin 3x -[-••• [284] 

and the corresponding partial sums are written 

= (7q -j- ill COS A* "!“•*•“}“ <7,i_i COS {ft — 1)a? [285] 

s^“\{x) = bi sin A* -f b-j sin 2a* -f • • • + b^ sin nx [286] 

The problem is to determine the coefficients in these partial sums so that 

~/i(A*) [287] 

s^'-\ax) ^f.ix) [288] 

According to Art. 2, the range 0 < a* < tt is divided into n subranges, the 
centers of which corres|X)nd to the A-values 0 < .Vj < a *2 < • • • < a*„ < w 
with 

»■* - - [2891 

With the coefficients 

cck8 = cos 5A*/; and ^3f:s = sin sx^ [290] 

as defined by Eqs. 52 and 53, the problem reduces to a determination 
of the solution of the systems of linear algebraic e(|uations 

n— 1 

ocksds = fl{Xk) {k = 1 , 2 , • • • 7l) [ 291 ] 

8=0 


Z /3ifc.6<, = fziXi) {k = 1, 2, • • • «) [292] 

8=1 

Multiplying these equations respectively by akr and ^kr and summing 
over k yields 


n— 1 / n \ n 

( 2^ OLkT^Cks ) E Ukrf 1 ( 

«=oV=i / ^•=l 


21 ( 0kr0k^ bs — ^ Pkrf2{Xk) 


In view of the orthogonality conditions stated by Eqs. 56 and 57, one 



Art. /5\ 


FOURIER ANALYSIS BY GRAPHICAL MEANS 


505 


linds 


and 


[n 

r a*r/l(.V,) = -j 2 
\na^ 

(n b 

i: = ’2 ^ 

A- - 1 j 

[nbn 


for r ^ 1 , 2, • • • — 1 

for r = 0 

for r = 1, 2, • • • » — 1 
for r = n 


Observin^^ that 


Of:n — si?l HXk = sin [k — }j)7r = ( — 1 V' ^ 


and that 


= 1 

Eq. 295 gives 

2 '' 

= - Z (r = 1, 2, • • • - 1) 

« A- - 1 

and in particular 

1 " 

whereas I^q. 296 yields 

= “ Z f^krJ^iA'k) ( 7 = 1, 2, •••;/- 1) 


with 

in = ‘ i 

« A' = 1 


[295] 

[296] 

[297] 

[298] 

[299] 

[300] 

[301] 

[302] 


The last four equations constitute the desired solution. The values 
of /i (.va) and foi-^k) ior k = 1, 2, • • • n are taken from the graphical plots 
for these functions, and the values of the coefficients a^r and ^kr are either 
calculated from the relations 290 or read from graphical plots of the sine 
and cosine functions, it being noted that the arguments of these functions 
are {2k — l)sTr/2ft, Thus all the values of the coeflicients akr and ^kr are 
readily taken from a pair of carefully plotted curves for the sine and cosine 
functions over a 90-degree interval. In this way the solutions given by 
Eqs. 299 to 302 are quite rapidly evaluated for reasonably large values 
of n. 
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It should be observed that the values of the eoetheients Hr and hr thus 
obtained, when substituted into the partial sums 2S5 and 286, yield tunc- 
tions which agree respectively with /i (.v) iind f y 'yx) at the points .Vi • • • .v„ 
and, of course, also at the points which correspond to the negatives of 
these .v-values. Over the small ranges between these points the values of 
the partial sums do not agree with the corresponding ones of the functions 
f\[x) and but if ;/ is chosen sufficiently large it is reasonable to 

expect a fairly good aj’iproximation between the partial sums and the 
functions which they are to rejuesent. 

If the functions /lU*'! and are relatively smooth, the points 

.'*1 • • • .v„ may be spaced farther apart; if the given functions are very 
irregular, the spacing of these j'loint.s must be smaller. At all events the 
spacing must be sufficiently (lose t(^ take acc'otinl of the most rapidly 
varying portions of the given functhms; otherwise such variations cannot 
be expected to be e\'en approximately reproduced by the resulting partial 
sums. 

Of the nature of the resulting approximation not much can be said 
inasmuch as this depends largely upon the particular characteristics of 
the given function and the chosen s})acing of the jxiints .Vi • • • .v„. In order 
to conser\'e computational labor, this spacing is ordinarily chosen as large 
as possible, in which case the coefficients in the vicinity of the terms of 
highest order cannot be expected to be even approximately eciiial to the 
true Fourier coefficients. However, in view of the discussion given earlier 
in this article, it does not necessarily follow that these coelhcients are 
practically of less value than the others. 

In various practical ])roblems, methods of detennining finite trigono- 
metric polynomials wliich exhibit controllable apjiroximation properties 
would be highly desirable. Of particular interest in this resj)ect would be 
a method of determining a polynomial which a])j)r(jximates the given 
function with a uniform tolerance, since this type yields the closest 
approximation for a given finite number of tenns. Such methods are 
desirable not only for use with gra})hically given functions but also for 
analytically given ones as well. Unfortunately, the.se questions have as 
yet received little attention. 

16 . Relation to the Bessel functions; Somaierfeld's 

INTEGRAL 

The problem of determining the spectra of frequency or phasc>modu- 
lated sinusoidal time functions, which occurs in the consideration of 
communication signals for which this type of modulation is used, reduces 
to the determination of the Fourier scries representation for the functions 
cos (p sin x) and sin (p sin x), in which p is a parameter. Since these are 
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the real and imaginary parts of the function 

f{x) = = cos (p sin x) -f- j sin (p sin x) [303] 

one may obtain both the desired Fourier series by considering this com- 
plex function alone. Incidentally, it is to be observed that this is a complex 
function of the real variable x and not a function of a complex variable. 
Xo extension of the Fourier series to functions of a complex variable is 
involved here, although such an extension is possible. 

The Fourier series for the function v?03 is advantageously written in 
the complex form 

3C 

/(.vj = 2^ ^304] 

in which 



'I'his integral is a special fonn of the more general complex integral 

Zp (p) = ^ J* e'" 'V"'^ L-506] 

which Sommerfeld has shown* to be capai)le of representing cylinder 
functions of all kinds according to the specitk choice made for the path 
of integration in the complex ('-plane. 

For the present problem it is sul'iicient to observe that the form which 
the integral 306 takes for the representation of cylinder functions of the 
first kind (Bessel functions), with the order p equal to an integer is 
(except for a factor identical with the intecial 305. This function is 
usually denoted by Jn{p)j thus, 



44ie restriction, that this integral representation for the Bessel function 
is valid only for integer values of is not violated in the problem of 
Fourier expansion considered here. 

Numerous variations in the form of the integral 307 are possible, a few 
of which may be worth mentioning. For example, making the change of 
variable x — > .r + tt/I and noting that the limits on the integral may, 
because of the periodicity of the integrand, be changed arbitrarily as long 

*Maih. Ann., 47 (1806), 385. Also in Riemann-Weber, Diffcrentialglckhnngen der Physik. 
Vol. II, p. 454 (ViewcR. 1927). 
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most engineering purposes by the expressions 



[322] 


[323] 


for values of .v larger than about 4. In eoniu‘Ction with the fre(|ueney or 
phase modulation ])roblem the arguments of the Bessel functions are for 



Fig. 26. A plot showing the close*ness of approximation to the Bessel function of 
order one by the function uf Kq. 32.^. 


the most part at least as large as this value or much larger, so that it is 
seldom necessary to consult the tables or curves for these functions. 
The functions of higher order are readil\ calculated in terms of 7o(*v) 
and J\{x) from the recursion formula* 

= -- Jn{x) - Jn-dx) . [324] 

X 


*The derivation and discussion of this formula nuiy be found in various well-known b(X)ks 
on the subject of Bessel functions. The approximate expressions 322 and 323 arc derived 
in Art. 26. 
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17. Fourier series ix terms oe more than one variable 


The Fourier scries representation of a periodic function may readOy 
be extended to functions of more variables. For example, if f{x,y) is 
periodic in both the variables .r and y with the fundamental ranges 
0 ^ .V ^ It and 0 ^ y ^ lir, and if, throughout the rectangular region 
thus delined, the derivative 


is finite and continuous, the series 






[326] 


is absolutely and unifomdy conxergent throughout the stated region 
and there represents the function /(.v,y) in the Fourier sense. 

As in the case of the Fourier series for a function of a single variable, 
the derivative 325 or the function /(.v,y) need not be continuous at all 
points of the region. The representation 326 is still possible as long as 
the Dirichlet conditions (see latter part of Art. 7) are satisfied in both 
the variables, but the series i.*> then no longer uniformly convergent 
throughout the entire fundamental region. 

Idle coefficients in the complex series 326 are given by the formula 


--’hi' ‘‘■''rS 




[327] 


which is a straightforward extension of the fonnula applying to functions 
of a single variable. Derivation of it may be assumed to proceed by con- 
sidering the function /(.v,y), for the moment, for a particular value of y. 
That is, for y = constant, /(.v,y) is a function o .v only, and a Fourier 
scries representation is possible in the form 


/(.v,y) = 

in which the coefficients 

••CO’) = f dx 


[328] 


[329] 


are functions of the parameter y. 

For any integer value of g, ^Im(v) may be represented by a Fourier 
series in the variable y, thus 

A,{y) = £ 


[330] 
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with cocfticients 

= f r A, (yV-=-^dy [331] 

Itt t/u 

Substitutmj][ Eq. v^v30 into Eq. 328 yields the double sum 326, which is 
the desired expansion for /(x,y); and substituting Eq. 329 into Eq. v331 
yields the formula 327. for the resulting Fourier coellicients. 

Thus the Fourier expansion may readily be extended to functions of 
any number of variables. 

18. Frequency groups 

In the present discussion a finite group of simple harmonic time func- 
tions (frequency com|X)ncnts) is assumed to be given, and an inquiry is 
directed toward determining the nature of the func tion defined by linear 
superposition of them. Although this problem has much in common 
with the consideration of the properties of tlie partial Fourier and F'ejdr 
sums discussed in Arts. 13 and 14, the present interpretations are directed 
toward an entirely different goal, namely, toward a means for represent- 
ing, in a closed fonn, functions which are not necessarily periodic yet 
for which the def ning range extends over the entire region of the inde- 
pendent variable from minus to plus infinity. 

To begin with a simj)le example, it is assumed that three simple 
harmonic functions with any finite amplitudes are given, having the 
frequencies 100, 125, and 150 cycles per second. The relative phase 
angles are for the moment immaterial. 4'he resultant function given by 
the sum of these three components is periodic, its fundamental frequency 
being 25 cycles per second. This conclusion is clear from the fact that 
25 is the highest common factor (H('F) of the group of numbers 100, 
125, and 150. The fundamental period is one-twenty-fifth of a second. 
Throughout this inter\'al the 100 cycles per second comjxinent completes 
4 cycles, the 125 cycles per second component completes 5 cycles, and 
the 150 cycles per second component completes 6 cycles. The original 
state of affairs is then re-established because each component has com- 
pleted a whole number of cycles. This statement is true no matter what 
the relative phase angles of the components may be. 

In the language of the Fourier series, the resultant function is repre- 
sented by its fourth, fifth, and sixth harmonic components alone. All 
other components, including the fundamental, are absent. It becomes 
clear that the linear superposition of a group of simple harmonic com- 
ponents yields a periodic function only if their frequencies have a com- 
mon measure. This means that the frequencies must be given by rational 
numbers or be rational multiples of the same irrational or transcendental 
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number. If several different irrational numbers (like v^2) or trans- 
cendental numbers (like tt or the naperian base e) are contained in the 
tjjroup of frequencies, the resultant function nev^r exactly repeats its 
sequence of values; that is, its period is infinite. The same is true if 
rational, irrational, and transcendental numbers in any combination 
are contained in the K^oup of frequencies. 

For example, the function 

/(/) - cos 100/ + cos (100 + tt)/ [332] 

never repeats its secjuence of values. This situation should not be con- 
fused with the well-known circumstance that the function given by 
J*]q. 332 can be inteq)reted as a beat phenomenon through conversion 
of the right-hand side of this equation to the form 

/(/) = 2 cos ^ • cos ^100 + ^ t [333] 

which is customarily j)lottefl through considering the slowly varying 
function 2 cos ( tt 2 )/ as an envelope containing the rapidly varying simple 
harmonic function cos (100 -}- 7r/2)/. Each half period of the function 
cos ( TT 2 )/ is commonly referred to as the “ beat period,” but this is a true 
period only if the ditTerence between the two frequencies involved in the 
expression 332 is at the same time their highest common factor. 

Even when the two frecpiencies are rational numbers the beat period 
is not necessarily a true jieriod. For example, the frequencies 100 cycles 
|XT second and 103 cycles per second give rise to a beat frequency of 
3 cycles per second, but the true fundamental frequency is 1 cycle per 
second. In other words, the exact pattern of the resultant function does 
not repeat until three beat periods have elapsed. The well-knowTi experi- 
mental fact that the human car (under proper circumstances) appears to 
recognize the beat frequency as though it wer actually present in the 
form of a separate component is a physiological phenomenon due in 
j)art to a nonlinear characteristic in the response mechanism of the ear, 
and this is not to be confused with the strictly linear superposition of 
com|X)nent frequencies considered here. 

It is interesting as well as instructive to generalize the problem of the 
simple beat phenomenon by inquiring how the interference pattern 
looks when more than two frequency components are superimposed. 
Incidentally, one must recognize that the familiar beat pattern in the 
case of two frequencies is pronounced only when the increment between 
these frequencies is small compared with either one. When the two 
frequencies and their difference are of the same order of magnitude, the 
conversion from the form of Eq. 332 to that of Eq. 333 is, of course, 
still valid, but the two functions whose product is represented by Eq. 
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333 then vary at about the same rate and tlie l)eat eluiracter of the result- 
ant function is lost. 

In generalizing the problem of the beat phenomenon it is, therefore, 
essential to assume a group of simple harnKuhc c(unj)onents whose si)acing 
in the frequency spectrum is small compared with the meaii frecjuency 

of the group, d'he line spei trum of 
such a grou]) containing .seven com- 
ponents i.s illustrated in Fig. 27. 'I'he 
mean angular fre(|uency is a',,, d'he 
adjacent freijiiencies are a',, oco, a*,, 
— olc, u,’,i 2occ, cl'o — 2(5cc*. ■ * ■ and 
forth, their uniform spacing 
being e(jual to ooj. 

In La-ncral. a fri‘(}Ut“ncy groiq) of 
this sort is considered to consist of 
}i components, and for simjdicity all the anlplitu(le^ are assumecl to be 
equal and all llie phase angle.s /t‘ro. It i.s further e.xpedient to .set the 
common amplitude equal to 1 'The width of the i> delined as 

Au' -- no^c .vvF 


-^Scc'r*- 



" 1 

r ' 

1 

1 

1 




""1 


Li. ; . 

: 

r* A(js H 



Fig. 27. The line spec t rum of a fre- 
quency group. 


The analytic exp;ession for the grouj) then reads 

cos a'l,/ — (.()> •'tCo •- ■■ u)> -UN, -|- 2oa; )/ j 


f(t) = 


( - 1 \ 

I U.'|| ^ I 


1 


-f- cos ict'd — b(j))t ^ cos ■ cCd — lbo))t 


i 


( " - n \ 

)S I a*d — ^ Ow I I 


By repeated use of the trigonometric identity 

cos {a Az b) = cos a cos h ^ sin a sin b 
this expression may be put into the form 


(T^5! 


[,F^6] 


... 2coscoo^ fl ^ n — \ 1 

/(O — * 1 o 5(Ajt-{-cos 2ba)l-{~ * * * Tcos — - — Oce/ |- 

1 2 2 J 


With the formula expressed by Eq. 11 this result may be written 

. uboj 
sin 

/(/) = • COS Wq/ 

. 5(jJ 
n Sin — I 


[.« 8 ] 
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Here the function 




. nhoi 

. bco 
n sm— i 


[339] 


is slowly variable, compared to the mean frequency component cos coq/, 
and may he reKarde{i as an envelope function enclosing this mean fre- 
(juency. The beat phenomenon is placed in evidence by the envelope 
function just as it is in the simple case of two interfering components. 



'rhis envelope fum tion is {dotted in Ing. 28 versus the time t for the 
t'ase n 7. 'The interference |>attern for the grouj) of frequencies whose 
line s{K‘( irum is given in Fig. 27 is lluw illustrated by Fig. 28. The regions 
of constriu tive interference, which lie in the vicinities of the main humps 
of the envelo{)e, are s|)aced at intervals of 

Itt 

T„ — - seconds [3*^0] 

'Fhe duration of each main hump is Att Aw seconds. The group period 
or beat {)eriod r., is, ai cording to Fap 340, inversely proportional to the 
frecjiiency increment 5w, and the duration of a region of constructive 
interference is inversely proi>ortional to the width Aw of the group. If 
this width is kept constant while more components are added to the 
group, the regions of constructive interference occur at longer interv^als 
but their duration remains the same. The number of smaller humps be- 
tween the large ones also increases; but, for a large number of component 
frefiuencies, the amplitudes of these smaller humps become insignilicant 
midway between the large humps and in this vicinity. 

The fact that the interval between regions of constructive interference 
is inversely proportional to the frequency increment 5w leads to the con- 
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elusion that, as this increment is allowed to approach zero, the heat 
period grows without limit. The frequency group finally becomes con- 
tinuous, and the resulting function has only one region of constructive 
interference in the entire time scale from minus to plus infinity. This 
reasoning assumes, of course, that each lre(|uency comix)nent in the 
group endures throughout the entire time scale; that is, it represents a 
true steady state component. 

The limiting form for the envelojx' function F{/), resulting from 
letting approach zero, is readily evaluated. Since the width Aw of the 
group remains C(mstant, n and 5w must vary inver.sely as the limit 
^ ^ ► 0 is carried to completion. I'lie trigonometric sine in the 
denominator of Kcj. Ov>9 may be replaced by its argument, so that the 
limiting form of this function is readily seen to be 

Aw 

sin - i 

FU^ = ■ ■ [.Ul] 

.Aw 



This is the function discussed in .\rt IS and plotted in Fig. 17, except 
that here 


V = 



[342] 


The resultant group function 


m = 


sm 




Aw 

V T' 


COS Wo/ 


[343] 


is now entirely transient in nature: that is, it never repeals, but has only 
one region of constructive interference. lk*yond this region the comjx)- 
nents in the group (which are now infinite in number) interfere de- 
structively forever. 

Since the phase angles of all the components are chosen equal to zero, 
the region of constructive interference lies at the time origin. If in the 
argument of each cosine term in E(|. 335 a phase angle —0 is inserted 
which has the following frequency dependence 

0(w) = (w — W())/o • [^44] 

the variable / in the above formulas for the envelope function is replaced 
by (/ — /o)- The region of constructive interference then occurs in the 
vicinity of the point t = /o- If the phase angles of the components in the 
group are given random values, the form of the resulting interference 
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pattern hiHomcs vx*ry difllcult to determino and in general exhibits no 
well-defined region of constructive interference. 

I'he fact that a continuous group of frec|uency com|X)nents gives rise 
to a resultant function having a transient nature suggests that if a 
similar limiting i)rocess is carried out with the Fourier series, a means 
will he obtained for analytically representing an arbitrary' nonperiodic 
function. This suggestion is followed in the following article. 

19. The Fourier integral 

The essence of the discussion in the preceding article may be sum- 
marized thus: Whereas the linear super{X)sition of a group of discrete, 
uniformly spaced, frefjiiency comj)onents (finite or infinite in number) 
gives rise to an interference pattern of a periodic nature, this pattern 
assumes a transient cnaracter when the frequencies in the group are 
continuou>ly distributed. 'I’he resulting so-called conlitiuoiis spectrum 
should be thought of as a line spectrum in which the spacing of the lines 
is allowed to approach zen^, and the transient character of the resulting 
time function may be thcnight of as the limiting form of a periodic func- 
tion for which the period has become infinite. That these two limiting 
processes are (insistent is evident from the fact that the spacing of the 
lines in the spectrum of a periodic function equals its lunflamental 
frecjuency, which is the ir.verse of its period. It may further be helpful in 
this connection to recognize that the period i which is the reciprocal of the 
spacing of the lines) is equal to the line density expressed as the number 
of lines jier cycles j)er second. .\s the period is allowed to grow without 
limit, the line density grows without limit, so that finally the spectrum 
lieGomes a continuous one and the function never repeats; that is, it 
becomes a transient function. 

Hv the carrying out of this limiting process in terms of the Fourier 
series Fq. 175 and the relation 176 for its coefiicients (spectrum function), 
an analytic means is obtained for the representation, in a closed form, 
of an arbitrary transient function. The value of such a mathematical tool 
in connection with various engineering problems is readily appreciated. 

Repeating, for convenience, the mathematical statement for the 
periodic case* 

/(/) = f [345] 

ri = — ao 

= r [346] 

ZTT t/ — *-/«i 

♦Tlic fundamental angular frciiuency is in the present article denoted by wi in order that 
the symbol w may be used to denote any arbitrary angular frequency. 



SIS 


FOURIER SERIES AMJ IM EORALS 


lUi. vn 


one should obser\'e that the limiting process indicated by r — ► » or 
wi — > 0 evidently is accompanied by a„ — ► 0. I'hat is, as the line spacing 
in the spectrum is allowed to become smaller and smaller, the amplitudes 
of the harmonic components also become smaller and smaller. However, 
it may be expected that the ratio an apj^roaches a finite limiting 
function. Thus, before carrying out the limiting process, it is expedient 
to rewrite Eqs. 345 and 346 in the fomi 


m = L 




5(;7aji) 


(-) = 5 ?- r dt 


[347] 

[348] 


In Eq. 347 the symbol stands for the increment in the variable 

;/wi, which is equal to the line spacing and hence equal to ct'i. 

The limiting process is now formally indicated by 


T — > OC 


Here it should be recognized that a new xariable for the frequency, a 
continuous variable, must be introduced to take the place of the dis- 
continuous variable uoji. This new variable (denoted by w) refers to any 
finite angular frequency in the continuous spectrum just as the discon- 
tinuous variable wwi does in the line spectrum. The introduction of the 
new symbol may be regarded as a convenient way of avoiding the ap- 
pearance of the quantities n and wi, which become improper in the limit. 
However, cj = itwi remains a proper variable and still refers to the 
frequency of any harmonic component in the limit precisely as it does 
before this limit is carried to completion. 

If this limiting process is thought of as being carried out in steps 
through doubling and redoubling of the period t, it becomes clear that 
each time t is doubled, any specific harmonic comjxment doubles its 
order n. If attention is fixed upon a sp)ecific harmonic frequency, n and 
031 must vary inversely (that is, noji must remain constant) as the period 
is increased. The frequency increment 6(«wi) in the limit is formally 
replaced by the differential and the ratio an/oji becomes a finite 
function g(w) of the continuous frequency variable w. This function 
expresses the variation of the harmonic amplitudes in the limit. 

It remains to recognize that as the limiting process is carried to com- 
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pletion, the summation in Eq. 347 becomes an integration. The final forms 
of the relations 347 and 348 are 


11 

g(o) do 

30 

[350] 

= t 

3 

T 

[351] 


This heuristic derivation of the relations 350 and 351 does not establish 
their correctness on a rigorous mathematical basis, but from an en- 
gineering [K)int of view a rigorous proof may properly be omitted since 
the priiK'ipal interest lies in the interpretation and use of these forms. 

It may be pointed out once more that the complex harmonic ampli- 
tudes are not given by gfcj) but by gijiA do. Since g(o) is finite, and do is 
the symbolic representation of a quantity which is regarded as becoming 
vanishingly small, the harmonic amplitudes are also vanishingly small. 
However, just as b[}toi ) denotes a constant spacing in the line spectrum 
of a periodic function, so do. which represents 5{;zwi) as the limiting 
process is carried to completion, must, at any stage of this process, 
likewise be regarded as a constant. Hence g(o) is proportional tog(aj) do. 
so that a plot of g(o) versus o shows the correct variation of the harmonic 
amplitudes with frequency even though all these amplitudes are vanish- 
ingly small. 

Equation 350 is called the Fourier integral representation for the 
function /!/). The function g(co) is called the Fourier transform of /(/) 
and, reciprocally, /(/) is called the inverse Fourier transform of gio). 
The second of the pair of integrals 350 and 351 transforms a time func- 
tion /(/) into its equivalent frequency function g(a;), and the first of these 
integrals reverses the proce.ss. The second integral analyzes the time 
function into a spectrum, and the first integral synthesizes the spectrum 
to regain the time function. g{o) represents the function in the frequency 
domain just as/(/) represents the function in the time domain. One may 
also regard Eqs. 350 and 351 as representing simultaneously a pair of 
integral equations and their mutual solutions. 

The graphical interpretation of the entire process of Fourier analysis 
and synthesis in terms of the frequency spectrum as discussed in Art. 9, 
as well as the method of manipulation of the forms 350 and 351 in con- 
nection with various physical and mathematical problems, remains 
exactly the same as for the Fourier series. Hence there is nothing new 
to be learned in this respect. The essential difference lies only in the fact 
that the spectrum function is continuous and that the synthesis of the 
spectrum is accomplished by means of an integral instead of a sum. This 
latter difference is actually an advantage because more formulas are 
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available for the evaluation of integrals than for the evaluation of sums. 
The principal advantage, however, lies in the fact that the Fourier 
integral is capable of representing transient functions. 

The conditions under which this representation is possible are es- 
sentially the same as those which apply to the rej) resen tat ion of a periodic 
function by means of a Fourier series. These are the Dirichlet conditions 
as pointed out in Art. 7. A detailed difference in the form of these condi- 
tions arises from the fact that the fundamental range is now infinite 
instead of being finite. In view of this dilTerence, the condition 136 reads 

f j/(/)i (// shall be finite [352] 

At a discontinuity of the function /(/) the Fourier integral (like the 
Fourier series) also yields the arithmetic mean between the two im- 
mediately adjacent values of the function (as stated for the Fourier 
series by the relation 137). 

The approximation properties of the Fourier integral are likewise the 
same as those of the series. In order to show this, one may consider the 
integral 350 for finite limits. Thus, the function 

s{t) = J* g{u}) du) [353] 

is the analogue of the partial sum of a Fourier series, since it represents 
the synthesis of a finite portion of the sjx'ctrum. Substituting for g(a;) 
the expression given by Eq. 351, and using B in place of the variable /, 
one has after interchanging the order of the two integrations 


s{t) = -- f "/(«) de [354] 

Writing for the exponential 

^ [355] 

and observing that the cosine is even while the sine is odd, one finds that 

J'“ = 2j[“ cos u{e-l) [356] 

and hence that Eq. 354 may be written 

s{i) = - / fifi) de r* cosco(^ — /) rfw • [357] 

TT U— V) */0 


But 

r , . . sin a{8 — l) 


[35S] 
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SO that 


s(t)--=-f f(e) — - — 

TCU—m (d — t) 


de 


[359] 


which is analogous to the integral 245 for the partial sum of a Fourier 
series. 

In order to illustrate the application of this formula one may consider 
the function defined by 


f(e) = 0 

for 

«<-l 


/(«) = 1 

for 


[360] 

/(9) = 0 

for 

»>l 


Then the integral 359 yields 

1 

- sin aid — /') 

[361] 

s{t) = - 

- 

dd 

TT J 

~sr2 

{ff - 1) 


which may be written 

, 1 

piis: 

sin II 

[362] 

s{t) = - 

TT 

J —a(S 

dl( 

2-rt) U 


or 


[ p*(«/2-0sinw , 1 sin // , 

sm = - I du / du 

TT Jo u TT Jo ii 


[363] 


Utilizing the definition of the 5f-function ac.ording to Eq. 257, and 
obser\'ing that Si (—.v) = —Si (x), one obtains the result 



Si a 




[364] 


Figure 29 shows a plot of this result for the choice a = 167r/5. The 
separate 5f-functions are also drawn (dotted) in order to illustrate more 
clearly how the resultant function is obtained. The similarity is very 
evident between this result and that shown in Fig. 21 for the partial sum 
of a Fourier series representing the periodic repetition of the present 
function /(^). In fact, the approximate evaluation of the Fourier sum 
for large values of n given in Art. 13 in connection with the problem 
illustrated in Fig. 21 likewise leads to an expression in terms of the 
5t-f unction. 
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The characteristic 18 per cent overswing at the points of discontinuity, 
yielding in the limit a — > « the Gibbs phenomenon, is again in evidence, 
just as it is for the Fourier series. Equation 364 and the corresponding 
plot in Fig. 29 also illustrate the fact that the value of the Fourier integral 
at the Joints / = ±5/2 equals the arithmetic mean between the two 
values of the function immediately adjacent to these t^^ints of 
discontinuity. 



Fig. 29. The Gibbs phenomenon ami the arithmetic mean properly are characteristic 
of the Fourier integral as well as of the scTies. 

20. Alterxativt: forms ix which the Foi^rier ixtegrals may 

BE WRITTEX 

The pair of mutually inverse integral relations 3vS0 and 351 may be 
written in a variety of different forms, with some of which it is well to 
be acquainted. One of the astonishing features about these relations is 
their almost complete identity in form. Thus, the integral 351, except 
for an interchange of the variables, w and /, dilTers from the integral 350 
only in the appearance of the factor l/27r and the reversed algebraic 
sign in the exponent of e. The first of these dilTerences may be removed, 
if removal seems desirable, by redelinition of the transform of /(/) as 

g*{(a) = VjjrgM [365] 

The relations 350 and 351 then assume the more symmetrical forms 

/(O = ^ r “ g* (a,) d <0 [366] 

V27r*^-- 


V2ir*'-- 


[ 367 ] 
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Alternatively, the appearance of a factor before these integrals may 
be entirely avoided through considering the transform to be a function 
of the cyclic frequency / (in cycles per second) instead of the angular 
frequency cj = 2 irf. Inasmuch as do) = It df, it is readily seen that when 

kU) = 27rg(a,) [368] 

the integrals 350 and 351 become rcspcctiv^ely* 

/KO = [369] 

«/ - * 

and 

ii(f) = /[/<(() [370] 

21. SpkCIAL forms YOR TIIK I'OURIER INTEGRALS WHEN THE GIVEN 
FUNCTION IS EVEN OR ODD 

If the Fourier integrals 350 and 351 are written in the more explicit form 
/(O = / aKco) [cos cot + j sin 0 ?/} dco [371] 

and 

j'icj) ^ f f{i){coscot — ysinw/j dt [372] 

then, if /(/) is an even function of /, the second term in the integrand of 

372 contributes nothing to the value of this inte.Lir al, and if /(/) is an odd 
function of /, the first term contributes nothing, in the first of these cases, 
moreover, g{co) must be an even function of w since the cosine is an even 
function, and in the second case, ^(w) is an odd function because the 
variable w is then contained only in the argument of the sine. Hence it 
follows that if /(/) is even, g(aj) is also even, and the Fourier integrals 
read 

m = x: g(w) cos cot dco [373] 

g((o) = “S to/ dt [374] 

*A different symbol is used here to denote the time function in order to avoid confusion 
with the symbol / used for the cyclic frequency. 
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If /(/) is odd, g{o)) is also odd, and the Fourier integrals have the form 


Z' ao 

/(O - J 1 gM sin o)t do) 

%J — ao 

[375] 

1 


[376] 


In both these cases the integration may alternately be extended over 
the range 0 to « and the integrals multiplied by 2. This factor of 2 may, 
if desired, be absorbed by the function .i?(a*) in the integrals 373 and 375, 
whence the factors for the integrals 374 and 376 become l/V instead of 
l/27r. The factor j appearing in the forms 375 and 376 evidently need 
not appear if the transform of /(/) for this case is defined as jgM and 
is denoted by a single symbol. 

These results show that if the given function'/(0 is real (although 
this is usually the case in practice, the validity of the Fourier integrals 
does not require it), ^(w) is real when/(/) is even, and purely imaginary 
when /(/) is odd. In general, for real functions /(/), one may decompose 
this function into its even and odd components /i (/) and f 2 (t) respectively, 
according to the relations 80 and 81, and then have for the corresponding 
transform 

gM = +jg2M [ 377 ] 

in which 

^ f fiiO = L r /,(/) cosw/<f/ [378] 

L'KU—Vi ZTTt/— OO 

and 

g2(") = sinutdt [379] 

22. Some elementary properties of the Fourier transforms 

In the Fourier integrals 350 and 351, if the algebraic signs of both 
variables w and / are reversed (co replaced by —a? and thy — /), the forms 
of these integrals remain unchanged except for the appearance of f{—t) 
in place olf{t) and g(— «) in place of g(w). To see this, one should note 
first that changing the signs of both w and t leaves the exponents of e 
unchanged; and, second, that the signs of do) and dt do change but so 
do the signs of the infinite limits, and since this amounts to an inter- 
change of the limits, the resulting signs of the integrals are the same as 
before. Hence, one obtains the result: 

If g(w) is the transform of f(t), then 
g(--w) is the transform of l{ — \) 


[ 380 ] 
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In the general case in which both /(/) and ^(co) may be complex, it 
should be observed that both sides of Eqs. 350 and 351 assume their 
conjugate values if the functions /(/) and .^(w) are replaced by their 
conjugate values /(/) and and if, in addition, the algebraic sign of 
either w or i (not of both) is changed. This change is equivalent to chang- 
ing the sign of j in the exponent of e. The statement, therefore, follows 
that: 


// g(cj) is the transform of f(t), then 
g(=poj) is the transfor^n o/ f(zbt) 


[381] 


As pointed out in Art. 20, the integral 350, which performs the inverse 
of the transformation giv^cn by the integral 351, is almost identical to 
the latter in form. If it were identical in form, one could omit entirely 
the distinction between the transform and the inverse transform of 
a given function, since two successive applications of the same trans- 
formation would yield the function itself. That is, the variables w and t 
might be interchanged, or in other words /(/) and ^(a>) might be regarded 
as each other’s transform. 

Although the actual situation is not quite so simple as this, it may be 
observ^ed that the forms of the two integrals do become interchanged if 
(a) Eq. 351 is multiplied by 27r, (h) the integral 350 is multiplied by 
l/ 27 r and the function g(oj) under this integral sign by 27r, and (c) the 
algebraic sign of either w or / is reversed. One may then interchange the 
variables w and t and regard g as the given function and / as its transform 
or equivalent frequency function. This conclusion is summarized by 
the statement: 


If gM is the transform of f(t), then 
f(±ci;) is the transform of 2xg(=F/) 


[382] 


In particular, if /(/) is even so that g(co) is even also, the process of 
changing the algebraic sign of either w or t may be omitted. It is then 
true that if g(aj) is the transform of /(/), one may reciprocally regard 
Ivg as a given function of time and have / represent the corresponding 
frequency function. 

The statement 382 may be given an even simpler form if it is made in 
terms of the functions/(/) and defined by the integrals 366 and 367, 
or in terms of the functions h{t) and g{f) defined by the integrals 369 and 
370. The factor 2x then does not mar the almost complete reciprocity 
existing between the pair of functions involved in the statement. 

Some problems make it convenient or necessary to change the time 
scale for the given function /(/) by some factor; it is then desirable to 
know the effect of this change on the corresponding transform. In order 
to see the effect, one replaces the variable t in the integral of Eq. 351 by 
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at. As a result, the differential dt becomes replaced by a times dl and the 
exponent of e becomes —jeoat. If now one replaces w by w/a, there results 

Hence it follows that : 


If gM is the transform t?/f(t), then 



is the transform of f(at) 


[384] 


This statement is true only if the factor a is positive. For example, 
for (2 = — 1 the statement 384 is evidently incorrect because it conflicts 
with the statement 380. This conflict arises because the algebraic signs of 
the limits change when a is replaced by —a. 

Another useful result follows from the obser\’ation that if the function 
/(/) in the integral 351 is replaced by the product /(/) • and the 

two exponentials in the resulting integrand are combined into one 
exponential with the exponent — 7 ( 0 ; =f a)o)f, the effect upon the function 
g is merely to replace its independent variable w by (a> =f wq)* Hence: 


If g(co) is the transform of f(t), then 
g(w Wo) is the transform of iit) ' 


[385] 


The latter function is the complex form of a sinusoidal function of 
angular frequency wo whose amplitude is modulated by the function /(/). 
The corresponding spectrum function is seen to be the same as that for 
/(/) except for a translation equal to the value of wq. Utilizing this result 
together with the principle of linear superposition, one may readily obtain 
the spectrum functions corresponding to /(/) • cos wo/ or /(/) • sin wq/. 

The complement to the statement 385 is obtained through assuming 
the function g(w) in the integral 350 to be replaced by Again 

combining the exponentials, one finds that: 


If g(w) is the transform of f(t), then 
g(<a) • is the transform (?/f (t ± to) 


[386] 


This result was observed in connection with the Fourier series in Art. 4, 
namely, that a displacement of the time function merely amounts to 
adding increments to the harmonic phase angles which are linearly 
proportional to the respective harmonic frequencies. 

Other useful relationships between the transforms are found through 
observing that it is permissible to differentiate or integrate with respect 
to the parameter contained in the integrand of either of the integrals 
350 or 351 as long as the resulting functions involved in these expressions 
still fulfill the conditions for Fourier integral representation. For example, 
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differentiating both sides of Eq. 350 with respect to t yields 

Repeating the process n times, one finds 

= j[‘ 

t 

from which one may conclude that: 

If gM is the transform ofi{t)^ Dien 

is the transform of the nth derivative of f(t) 


[387] 


[388] 


[389] 


The function p^iven by this wth derivative of /(/) must, of course, still 
fulfill the requirements for which its representation by means of the 
Fourier integral is valid. If the function given by the integral of /(/), 
namely, 

F(e) = f f{d) do [390] 


also fulfills these conditions, then since 


|[X>)^»]-f -/«) 


it may be inferred from the statement 389 that: 

If eM transform of f(t), then 

g(w)/ja} is the transform i{e) dd 


[391] 


[392] 


Under the same conditions, this statement may be extended to functions 
formed by successive integrations of /(/). 

Similarly, by differentiating or integrating E‘q. 351 with respect to w, 
one finds that : 


If sM transform of f(t), then the nth 

derivative of g(w) is the transform t?/ (— it)“f(t) 


[393] 


or: 


If g(w) is the transform of f(t), then 
I g (m) m is the transform of f (t)/ — jt 

«/ — eo 


[394] 


Again, the functions {—ftYfit) and f{t)/--jt must still fulfill the condi- 
tions for Fourier integral representation * 

*The conditions under which the statements 389, 392, 393, and 394 are valid are actually 
far less rigid than the Dirichlet conditions provided one interprets the results in the light 
of Art. 24. 
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23. The transform of a product and the interpretation of 

POWER PRODUCTS AND EFFECTIVE VALUES FOR TRANSIENT 
FUNCTIONS 

When the given function /(/) is expressed as a product of two 
component functions as 

m =/i(/)/2(/) [395] 

one is interested to know how the transform may be expressed in terms 
of the individual transforms of the components /i(/) and / 2 (/). 

It is helpful to observ’e that this problem is very similar to that of 
determining the expression for the coefficients in the resultant series 

te 

y = yiy-i = Z ars’' [396] 

r = — oo 

in terms of the coefficients in the component series 

yi = f [397] 

m — — =o 

and 

>-2= L bnz’' [398] 

n ** — ao 

Forming the product of these two series, one has 

y= i i a„bn!r+’' [399] 

m = — • n = — • 

or, letting 

m + n = r [400] 

one may write this as 

y= i ( i. a„br-„)z^ [401] 

in which the substitution of the summation index r for n is permissible 
since the summation over m is independent of that over n, A comparison 
of Eqs. 396 and 401 reveals that the desired relationship reads 



ar = 

m 

dmf^r—m 

[402] 

Similarly, if 





A(0 - / 

— 00 

[403] 

and 


Mt) = j 

f g 2 {'>) dv e’'^ 

' — « 

[404] 



AH, 23] 


POWER PRODUCTS AND EFFECTIVE VALUES 


529 


are the Fojurier integral representations for the component time functions 
in Eq. 395, one may write 


%J — sot/ — ao 

[405] 

since /x and v are completely independent variables. 

It is now possible to make the change of variable 


V = oj — /x; dv = db) 

[406] 

because m is a constant parameter as far as the integration on 
cemed in Eq. 405. 

Equation 405 then becomes 

V is con- 

/(/) = f do} f ^i(/x)^2(w - n)dne^^^ 

U — ao U — X 

[407] 

If this is written 


/(0= 

•/ — « 

[408] 

the transform of /(<) is seen to be given by 


g(«) = y*^gl(M)g2(“ - ti) dn 

[409] 


which is the desired relation. It is entirely analogous to the result ex- 
pressed by Eq. 402 for the corresponding problem in terms of series. 


Since there is no need for distinguishing between the functions /i(/) 
and / 2 (/) in this argument, it is evident that Eq. 409 remains true if the 
subscripts 1 and 2 are interchanged. 


In an exactly analogous fashion one finds that if 



X 

[410] 

and 



S' 

II 

— to 

[411] 

then the inverse transform of the function 


g(«) = Xi(")52(«) 


[412] 



sso 


FOURIER SERIES AND INTEGRALS 


iCh. VII 


is given by* 

/(/) = ^ fi ie)f 2 H-e)de [413] 

Here, also, the subscripts 1 and 2 may be interchanged. 

Expressing/(/) in this last result in terms of its Fourier integral, one has 

(“k 2 (‘o) doi C''"' = ^ /i (®)/ 2 (< - 6) de [414] 

Setting / = 0 and subsequently substituting the symbol t for B in the 
integral on the right of Eq. 414 yield the result 

/* = 2Tr C gi (t*))g2(=Ea>) 

*/ 30 */ 30 

in which the statement 380 is also used. 

This relation is analogous to the result stated for the Fourier series 
by Eqs. 190 and 196, in Art. 10. In other words, the left-hand side of 
Eq. 415 may represent the integrated power product for some physical 
system, whence the right-hand side expresses this integrated power in 
tenns of the transforms corrcsjx)nding to the given time functions. The 
relation 415 is applicable to transient functions in the same way that 
Eqs. 190 and 196 are to periodic functions. 

If, in particular, 


/2(=F<) =7i(=f0 

[416] 

then, according to the statement 381 


g2(=Fw) = |i(±w) 

[417] 

or 



[418] 

The relation 415 for this special case reads 


f \f dt = 2t f |gi(«)|^da) 

*/ — ee t/ — 00 

[419] 


which for transient time functions expresses an analogous relation to 
that stated by Eqs. 200 and 201 for periodic functions. The square root 
of the value given by Eq. 419 may be called the effective value of the 
transient time function. According to the definition ’of the effective 
value of a periodic function, one forms the square root of the mean of 
the integrated squared values of that function over a period. The interpre- 

*Tlie mathematical procedure described by this integral is called “ convolution ” (in the 
German literature the term used is “ faltung ”)• 
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tation of the relation 419 offers a complete parallelism between the 
periodic and the transient cases. 

Finally, it is possible to form another relationship which at times 
becomes practically useful. In Eq. 415,/2 (=f/) is any time function, and 
is its transform with the reversed algebraic signs of its inde- 
pendent variable. If the time function is chosen to be 2x^2 (=f 0> then, 
according to the statement 382, its transform with reversed signs of its 
variable is/of^Fco). Since the corresponding signs of the variables t and 
oj are now alike, the signs may be dropped, and one has in place of 
Eq. 415, 

/* — f ["^ 20 ] 

t/ 00 */ 30 

In connection with this relation it should be carefully observed that 
the functions and ^2 are understood to be the Fourier transforms of 
the functions /i and /o, and the latter are the inverse transforms of gi 
and g 2 ' The reason for this emphasis is that the reader may make the 
mistake of regarding /o (a;) as the transform of g 2 (J) because the latter is 
written as a function of / and the former as a function of oj. This con- 
fusion may best be avoided through adopting an entirely different symbol 
for the independent variable, and using the same symbol for both inte- 
grations since the distinction between given function and transform is 
expressed respectively by the symbols / and g, and has nothing to do 
with the symbols used for the independent variable or variable of inte- 
gration. The preferable form of Eq. 420, therefore, reads 

f Mx)g 2 {x)dx= f gi{x)f.i{x) dx [421] 

24 . Some illustrative examples; the singularity functions 

The first function to be considered is defined by 
/(/) = 0, for / < 0 
/(/) = for / > 0 

This function is illustrated in Fig. 30. According to Eq. 351 its transform 
is given by the integral 

^ dt [423] 

which yields 

1 

2Tr(a +yco) 



g(«) = 


[424] 
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This result may be written 

«(") = 2.(a “+a,*) 



Fig. 30. An approximation to i 

the unit step by a decaying According to the relations 377, 378, and 
exponential. 379, these are the'transforms of the even 


and odd components respectively of the 
function /(/). The odd components are shown plotted in parts (a) and 
(b) of Fig. 31. Both these components are given by for / > 0, 

but fi (t) is symmetrical about the vertical axis at the origin, whereas 



(a) (b) 

Fig. 31. Even and odd components of the function of Fig. 30. 


/2(0 is antis)anmetrical. The results given by Eqs. 426 and 427 may 
alternatively be obtained from the integrals 378 and 379 resp)ectively. 
Since the integrands in these integrals are even functions, the same 
value is obtained if the integration is extended only over the range 0 
to 00 and the result multiplied by 2. Thus it must be true that 

~ - 2.(/+ J-) t428] 

and 


g 2 («) = " 2 “ sin (at dt = 


2ir{o? + (a^) 


which, incidentally, is a simple way of evaluating these two particular 
integrals (the usual process requires two successive integrations by parts) . 
These functions are shown plotted versus the ratio w/a in Fig. 32. 
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Hence the point unity, for example, corresponds to w = a, so that, as a 
is given smaller and smaller values, this point corresponds to smaller 
and smaller values of w. At the same time it should be observed that the 
ordinates of these curves are inversely proportional to a. As a is assumed 
to become smaller, the function gi (w) appears to become more peaked, 
until in the limit a 0 it degenerates into a single infinite ordhiate at 
w = 0, and is zero everywhere else. 



Fig. 32. The transforms of the even and odd parts given in Fig. 31. 


It should be observed, however, that the area under the curve ^i(«) 
is constant and independent of a, for it represents, according to the 
Fourier integral 350, the value of/i(/) for ^ = 0, that is, 

/l(0) = = J [430] 


As'a becomes smaller this area becomes more and more concentrated in 
the immediate \dcinity of the point w = 0, until in the limit a 0 it is 
contained within a region of vanishing width at. the origin. Since the area 
remains finite, the ordinate of gi(w) at w = 0 must clearly become 
infinite in this limit. It is quite significant that the function gi(«), there- 
fore, does not vanish for a = 0, as one might at first glance conclude by 
inspection of the function 426. 

From Eq. 427 it is, on the other hand, easily seen that 


limit [g 2 (")] = 

a— K) 


[431] 


This is the equation of a rectangular hyperbola. The manner in which 
this limiting function is approached by 1:2 («) as a is assumed to become 
smaller and smaller is readily visualized from an inspection of Fig. 32. 

Turning now to the given function /(/), one sees that in the limit 
a — > 0 this function is zero for / < 0 and equal to the constant value 
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unity for / > 0. In Fig. 30, the curve for / > 0 then no longer falls off 
with increasing time but maintains the ordinate at / = 0 for all positive 
values of t. The limiting forms of the even and odd components shown in 
Fig. 31 are also readily \isualized. /i(/) reduces to the constant H;/2(0 
equals the constant value for / < 0, equals the constant value +3^ 
for t > 0, and retains the discontinuity of unit magnitude at / = 0. 

This limiting function, delined by the relations 422 for a = 0, actually 
does not possess a Fourier integral representation, for it no longer fulfills 
the condition 352. As long as a has a nonzero value, however, no matter 
how small, the Fourier representation is possible, and hence, for a i)roper 
interpretation of the limiting forms of the functions .^i(w) and .? 2 (w), 
such a representation may be said to be possible even in the limit a = 0. 

With the use of Eqs. 373 and 375 one has 

X ao ^00 

(w) cos cot dco — J sin cot dco [432] 

According to the preceding discussion of the function it is clear 

that as a becomes very small, the total contribution to the first of these 
two integrals is due almost entirely to the values of the integrand in the 
immediate vicinity of the point w = 0. If the variable / is for the moment 
assumed to remain finite, then for a sufficiently small value of a, the 
function cos ot remains equal to unity over that small range in the 
vicinity of w = 0 which contributes almost wholly to the value of this 
first integral. In the limit « — > 0 this reasoning becomes exact, and in 
view of Eq. 430 one may, therefore, conclude that in or near this limit 
Eq. 432 is equivalent to 

fiO = \ f K 2 M sin cot dco [433] 

2 t/ — » 


Substituting the limiting value of the function as expressed by 

Eq. 431, one has 




which may be written 


fit) = + limit 

Z a — ► 30 


U 8P (f) 

[4341 

\ 1 sin w , 1 

[435] 


According to the discussion of the 57-function in Art. and with the 
help of Eq. 257 the last result may be written, 
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Hence it is clear that, except for the Gibbs phenomenon at the point of 
discontinuity, the Fourier integral representation for this function is 
valid even in the limit a = 0, provided the integrals are properly 
interpreted. 

It should be observed that one cannot simply set a. equal to zero in 
the expression 424 for the transform of/(/), for then ^(co) becomes iden- 
tical with and g\ (w), which contributes the value 3^ to the Fourier 
integral representation, is lost entirely. As long as a is retained as a 
small quantity, and discarded only after the proper interpretation of 
the various steps in the process of evaluation, no difficulties are en- 
countered. 

In order to illustrate this point from a slightly different angle, one may 
substitute the value 424 for g{<j}) into the Fourier integral 350 and have 

ZT t/ a 


With the sine and cosine equivalent of the exponential function, this is 


^ , 1 r * cos co/ , / r * sin 0 )/ , 

/(/) = -- / ^ \ r- do3 

Itt J a Jo) Itt J— * a -j- 


[438] 


The second of these two integrals remains proper in the limit a — > 0, and 
hence one may set a = 0 in the second term of Eq. 438 without further 
ado. This term is then the same as the second term in Eq. 434, and hence 
it is evident that the first term in Eq. 438 is supposed to yield the value 3^. 
The integral in this term, however, becomes improper in the limit a —> 0 
because the integrand then becomes infinite for w = 0. If, nevertheless, 
a is set equal to zero, one observes that the integrand is an odd function 
of oj, and since the limits of integration are symmetrical, the value of the 
hitegral, except for the difficulty in the vicinity of w = 0, should be zero. 
In other words, whatever value this integral may have must certainly 
be contriliuted by the immediate vicinity Ot the point w = 0. For this 
vicinity, which may be denoted by —p<o)<p, one may again set 
cos oj/ equal to unity (since for very small increments from w = 0, cos w/ 
differs from unity by a small quantity of the second order) and have 
for the first term of Eq. 438 


1 do) 

2TrJ-'pa jeo 



27r; L « — JPJa- 


[439] 


which is the correct value. However, the importance of retaining a up to 
the last step during this evaluation process should be noted. Until then, 
a plays an important role toward guiding the evaluation and preventing 
misinterpretations; after this step has been taken, a has served its pur- 
pose and may be retired without causing further difficulties. 
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The integration in Eq. 439 may be understood clearly if it is regarded 
as an integration in the complex plane. Thus, when 


f = a +ja 


this integral takes the form 


r*plane 

-(a^jp) 


JL 

2x7 J(a—i 


ia+jp) (ff 
t—jp) f 


2wj J(a—Sp) f 


j, fir^(o^’^jp) The sketch in Fig. 33 indicates the path of in- 

/ i . tegration in the complex f-plane. Inasmuch as the 

— quantities (a -f jp) and (a — jp) have the same 

\ magnitude, it is clear from the discussion of the 

f logarithm function in Art. 17 of the preceding 

I \j chapter that the value of the integral 441, the 

~ factor X/lTrj being omitted, is a pure imagi- 

I nary quantity, equal in magnitude to the differ- 

Fig. 33. The path of between the angle of (a +jp) and that of 

integration for the in- (« This net angle clearly approaches the 

tegral of Eq. 441 rele- value x as a is allowed to become zero, but it is 
vant to obtaining the to be noted that if a is set equal to zero to start 
unit st^ from its trans- with, the path of integration lies upon the imagi- 

Foi^er integir*^* 

equally well be regarded as given by — x. With 
a retained, this ambiguity is avoided, and as soon as the correct interpre- 
tation has been seen, a may be discarded. 

The second example to be considered is the function defined by 


Fig. 33. The path of 
integration for the in- 
tegral of Eq. 441 rele- 
vant to obtaining the 
unit step from its trans- 
form by the inverse 
Fourier integral. 


m 

= 0, 

for 


m 

= 1, 

for 


m 

= 0, 

for 

^>1 


This is the same function which is considered in Art. 19 in the discussion 
of the approximation properties of the Fourier integral, as illustrated by 
the plot of Fig. 29. The transform, according to Eq. 351, is given by 


1 6 


[ 443 ] 
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The similarity of this result to that given by Eq. 187 for the complex 
Fourier coefficients in the series representation of the function illustrated 
in Fig. 12 should be noted. Inasmuch as the latter represents the periodic 
repetition of the function considered at present, this similarity between 
the expression for the Fourier coefficients on the one hand, and the 
Fourier transfonn on the other is, of course, not surprising. It may be well 
to point out here that the symbol w in Eq. 187 represents the fundamental 
angular frequency, and hence the quantity ww in that equation (not 
just w) becomes the analogue of « in Eq. 443. The latter may be obtained 




f(t) 





Fig. 34. The rectangular pulse of unit height and its associated Fourier transform. 


from Eq. 187 through writing wi for w, and then applying the limiting 
process indicated in the expressions 349. With reference to Fig. 12, this 
process leaves the rectangular pulse at the origin but causes the adjacent 
pulses to move to infinity, thus yielding in the limit the function defined 
by the statement 442. 

The transform g(w) as a function of w evidently has the appearance of 
the dotted curve of Fig. 13, showing the form of an as a function of mo. 
The important difference in the present example is the fact that the 
spectrum function gioi) is continuous; that is, all frequencies are present, 
not just integer multiples of a fundamental frequency. Figure 34 shows 
both the time function /(/) and the corresponding transform or spectrum 
function g(co). This spectrum is a continuous one, whereas the spectrum 
shown in Fig. 13 is a line spectrum. It should be remembered, of course, 
that the amplitudes of g(co) are not the harmonic amplitudes. The latter 
are all vanishingly small since their magnitudes are symbolically indi- 
cated by the differential notation g(w) do. The function g(w), neverthe- 
less, shows how these amplitudes vary with w for any increment dw 
however small. This is true because, at any stage in the limiting process 
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indicated by the expressions 349, the differential spacing dw of the 
lines ” in the continuous spectrum is constant, just as the finite spacing 
(w in Eq. 187) is in the corresponding periodic case. 

In \dew of the statement 382 it is useful to observe that if the scale of 
ordinates in the plot of ^(w) in Fig. 34 is multiplied by Itt, the variables 
w and t for these plots may be interchanged. That is, 2irg{t) may be 
regarded as the given time function and /(w) as its transform. There is no 
need to reverse the algebraic sign of one of the interchanged variables 
w or / in this example, because the functions are even. Thus, a time 
function like the curv’e for g(oj) in Fig. 34 is seen to have a spectrum like 
the curve for /(/). This spectrum is, of course, also a continuous one, but 
the interesting feature about it is its finite extent. 



This result may be predicted on the basis of the discussion in Art. 18 
regarding the resultant interference pattern of a frequency group. It is 
pointed out there that the frequency group of Fig. 27 has an envelope 
function like that plotted in Fig. 28 and expressed analytically by Eq. 
339. As more and more lines are added to this frequency group, until it 
becomes a continuous spectrum of finite width like the function /(/) in 
Fig. 34, the envelope function of the corresponding interference pattern 
approaches the form given by Eq. 341, which when plotted has the 
appearance of g(aj) in Fig. 34. In other words, the function /(/) in Fig. 34, 
regarded as a spectrum function, is a continuous frequency group whose 
resultant interference pattern (time function) has the appearance of the 
function g(cj) in Fig. 34. 

By means of the statement 385 it is a simple matter to determine the 
spectrum function which results when the time function of Fig. 34 is used 
to modulate the amplitude of a sinusoidal function of arbitrary frequency. 
Figure 35 illustrates such a time function, namely, a cosine function 
enclosed by a rectangular pulse. The same figure also shows the corre- 
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spending transform, which consists of the linear superposition of the 
functions g(w — wo) and g(w + m)- These are obtained through multiply- 
ing the function gioj) of Fig. 34 by 3^ and displacing it respectively to the 
right and to the left of the origin by the amount ojo, which is the angular 
frequency of the time function within the interval —8/2 < t < 8/2. 

It is interesting to consider the way in which the spectrum function 
for this example changes as the duration 5 of the sinusoidal time function 
is increased. One should observe that the functions giu) — wo) and 
j^(a) + coo) consist essentially of a large hump w^hose amplitude is propor- 
tional to 8 and whose width at the base of this large hump is inversely 
pro{x)rtional to 5. As 5 becomes very large, the spectrum function of 
Fig. 35 assumes the form of two very tall slim humps, one at coq and the 
other at — coq. In the limit 5 oo the spectrum function is given by two 
lines at the points rbcoo, as one should expect from the fact that the single 
frequency coq alone then characterizes the resulting time function. 

A type of function very useful practically is readily derived from the 
function /(/) in Fig. 34. If the amplitude of this function is set equal to 
1/5 instead of unity, the area enclosed by the rectangle equals unity no 
matter what 5 may be. For a large value of 5, the rectangle is long and low, 
for a small value of 5, it becomes thin and tall. The transform g{o)) 
corresponding to this /(/) is given by the expression 443 divided by 5, 
that is, by 


.?(«) = 


1 





“I j 


[444] 


If 5 is now allowed to approach zero, /(/) degenerates into a single 
infinite ordinate at / = 0; that is, the rectangle has zero width and an 
infmite height but still encloses unit area. This limiting form of the 
function /(/) of Fig. 34 is called a unit impulse and may be denoted by 
Uo{t). Its transform is given by the expression 444 for the limit 5 — > 0. 
With this limiting value of the transform denoted by i;(aj), it is readily 
seen that 

J'(«) = [445] 


In other words, the transform of the unit impulse is a constant. 

In view of the preceding discussion of the function gi(<o) of Fig. 32 
for the limiting process a 0, this result may be arrived at in a different 
manner. The inverse transform of namely /i(0 of Fig. 31, becomes 
equal to the constant M for a^O; gi(&)) degenerates into a single 
infinite ordinate at u = 0, although the area enclosed by the curve for 
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giipy) remains constant and equal to According to the statement 382, 
the transform of lirgx (/) in the limit cy 0 is the constant Hence the 
transform of 2gi (/) for a — ♦ 0, which is an impulse enclosing unit area, 
is the constant l/27r, the same as ?(^*>)- In other words, the function 
2^1 (0 for the limit a — > 0 may be identified with the unit impulse 
and its transform may be identified with i»(a>) in Eq. 445. It is rather 
interesting that this conclusion should be true, in view of the fact that 
Wo(0» in the argument of the preceding paragraph, is approached by the 
rectangular pulse /(/) in Fig. 34, and the limiting form of 2gi{t) is ap- 
proached by the rounded pulse of Fig. 32. 

The fact that the transform of the unit impulse is constant and equal 
to l/27r may also be seen graphically. With reference to Fig. 34, if /(/) 
is multiplied by 1 5, then g{oi) is multiplied by 1/ 6 also. Its amplitude 
is then independent of 6. As 6 becomes smaller, the distance from the 
origin, co = 0, to the points a? = zb27r^6 becomes larger. For a very small 
6 (tall narrow rectangular pulse) this distance is so large that, over a wide 
range of frequencies, ^(oj) drops off ver>" little from its value of l/27r at 
o) = 0. Finally, as 5 approaches zero, the points a> = zh2x/5 move to 
infinity, and the curv'e for gios) becomes a horizontal line l/27r units above 
the oj-axis. 

Conversely, one may let ^(o?) in Fig. 34 approach the unit impulse. 
The area under this curve already equals unity because it represents the 
value of /(O), according to the Fourier integral 350. Making use of the 
statement 382 again by saying that (1 2ir)f(o3) is the transform of g{t)y 
and this time letting 6 approach infinity, one finds that g(t) approaches 
«o(/), and (l/2s-)/( w) approaches the constant l/27r. 

The next function to be considered is shown in Fig. 36. This is the 
integral form — » to / of the function /(/) in Fig. 34 multiplied by 1/6. 
Since the transform of the latter is given by Eq. 444, it follows from the 
statement 392 that the transform o^ the present time function is given by 


g(") = 


1 

2Trj(a 


/ . A 

sin 0 ? - 


\ "2 / 


[446] 


For small values of w this transform behaves like the g(«) of Eq. 424 
for a = 0, that is, like the transform of the time function defined by the 
relations 422 for a = 0. In other words, the transform 446 for the function 
illustrated in Fig. 36 has a nonintegrable infinity at « = t). This is due 
to the fact that the present time function does not fulfill the condition 
352, and the same diflSculty occurs as discussed in connection with the 
function defined by the relations 422 for a = 0. Since the method of 
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dealing with this difficulty is now understood, however, the transform 
given by Eq. 446 may be accepted as an integrable function. 

As 6 is now allowed to approach zero, the function of Fig. 36 assumes 
the form shown in Fig. 37. This form, however, is the same as that of the 
function defined by the relations 422 and illustrated in Fig. 30 for the 
limit a — > 0, as is also evident from the fact that the transform 446 for 
5—^0 becomes identical with the transform 424 for a —> 0. As indicated 
in Fig. 37, this limiting form of the function of Fig. 36 is denoted by the 
symbol 



Fig. 36. The integral of a rec- Fig. 37. The function of 

tangular pulse similar to that of Fig. 36 where 5 is allowed 

Fig. 34. to approach zero. 


Since the function of Fig. 36 is the integral from — « to / of the 
1/5-multiplied time function of Fig. 34 for any value of b however small, 
one may regard the function of Fig. 37 as representing the integral of the 
unit impulse Wo(0* Symbolically this fact is expressed by 

«-i(0 = L Wo(0 [447] 


in which the function «_i(/) is called the unit step function or, more 
briefly, the unit step. It is defined by the relations 


w_i (/) =0, for / < 0 

«_i(/) = 1, for / > 0 


[448] 


and hence is identical with the function defined by the relations 422 for 
a = 0. It plays an important part in the Heaviside Operational Calculus, 
but in the more recent expositions of this subject the unit impulse func- 
tion Uo{t) is found to be of greater value, chiefly because its transform is a 
constant. 

From these discussions it becomes clear that the unit impulse may 
alternatively be expressed as the time derivative of the unit step, that is. 


[ 449 ] 
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This is in agreement with the statement 389 since the transform for the 
unit step is given by Eq. 424 for the limit a 0, and this result multiplied 
by 7 * 0 ? reduces to 1/2 t. 

From a conserv^ative mathematical point of view the differentiation 
of a function having a discontinuity is considered not permissible and is 
regarded as having no meaning. In view of the present discussion, how- 
ever, it is clear that such operations are permissii)le provided they are 
properly interpreted. Thus it is possible to deline functions and corre- 
sponding transforms for successive derivatives of the unit impulse. The 
first derivative is written 

«i(/) = ^ [450] 


and its transform, according to the statement 389, is 


joivM = 


2ir 


[451] 


The function «i (/) is called the unit doublet. The 
reason for this name is clarified by reference to 
Fig. 38 which illustrates graphically the manner 
in which the derivation of this function is to be 
interpreted. Starting with the function shown 
in this figure, one obtains the unit doublet U\{l) 
by passing to the limit 5 0. Since this limiting 

process may be indicated symbolically through 
replacing 5 by the differential time increment dt, 
the correctness of this graphical interpretation 
may be seen analytically from the fact that 

dup _ Up{t -f dt) — uo(t) 
dt dt 



s 



Y 

1/6^ 

1 





o 


A ^ 

i/i* 

1 






Fig. 38. A rectangular 
pulse doublet. 


[452] 


or 


dup 

dt 


Up 




dt 


[453] 


When the statement 386 is applied, the corresponding transform is seen 
to be 


dt 


v{u)ju dt 
Jt 


= juv{6>) 


[ 454 ] 


which agrees with the discussion surrounding Eqs. 450 and 451. 
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The unit doublet is evidently equivalent to a pair of equal but opposite 
impulses which are immediately adjacent to each other at the origin. 
The net effect may be likened to that of a couple in mechanics, and for 
this reason the term couple is sometimes used in place of the term doublet 
to designate the function Ui{t), 

It should be observed that the two impulses involved in this interpre- 
tation are not unit impulses. As shown in Fig. 38, the area enclosed by 
each rectangular pulse has the value 1/5. In the limit 5-^0 this area 
becomes infinite, and the resulting impulse is seen to be one of infinite 
value rather than one of unit value. These considerations are pertinent 
to the proper interpretation of the expressions 452 and 453 inasmuch as 
u^/dt symbolically represents an impulse of infi- 
nite value. 

Through continuing in the same way a se- 
quence of functions may be formed. The next in 
order is defined as 


i: 


dui (Puo 

"=<'> -HF-W 


[455] 


and its transform, according to the statement 389, 
is given by 


2/4’ 

li 


(jQ>yv(u) = 


Irr 


[456] 


Fig. 39. A rectangular 
pulse triplet. 


The function U2(0 rnay be interpreted graphically as the limit of 
the time function sliown in Fig. 39 as S is allowed to approach zero. It is 
equivalent to two equal but opposite doublets of infinite value centered 
about the origin and separated by the increment 5 = dt. Thus one may 
write 


dui 

«2«) = ^ = 




dt 


[457] 


and, making use of the statement 386 again, have for the corresponding 
transform 


— j(av(u))e 
_ 




[458] 


It is also possible to extend this sequence of functions in the opposite 
direction by successively integrating «o(0- 

The first integration yields the unit function «i(/) or unit step. The 
next integration yields a time function which is linearly proportional to 
the time, like the current through a pure inductance when a constant 
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voltage is applied. In connection with practical problems there is little 
actual use for any of the functions in this sequence except the unit 
impulse and the unit step, although a recognition of the availability 
and the interpretation of the general sequence of functions together with 
their transforms proves to be a useful tool in the application of Fourier 
integral analysis to various practical problems. 

According to the preceding discussion it should be clear that the 
transform of the general function u„(t) in this sequence is The 

sequence is referred to as the singularity functions, the unit impulse and 
the unit function being singularity functions of the order zero and minus 
one respectively. It should be observ^ed that the singularity functions 
multiplied by lir are the inverse transforms of the integer powers of {jo)). 
Inasmuch as the inverse transform of ( 7 ^)” cannot be found in the usual 
fashion because the integral 350 becomes improper, the discussion in 
this article may essentially be regarded as an ^ interpretation process 
which avoids this difficulty and demonstrates the existence of such 
integrals under suitable limiting conditions. 

25 . The error function and the seoihcnce of singitlarity 

FUNCTIONS BASED UPON IT 

It has been seen from the preceding article that the unit impulse and 
its transform may be obtained through applying to other suitably chosen 
functions, besides the rectangular pulse, a limiting process by which 
they degenerate into a single infinite ordinate enclosing unit area. Corre- 
spondingly, the entire sequence of singularity functions and their trans- 
forms may be derived through applying suitable limiting processes to 
a variety of properly chosen functions. One of the most interesting of 
these is the so-called error fuftclion which has the form 

/(/) = [459] 

According to the integral 375, the transform of the error function is 
given by 

g(«) = ~ e"^**cos«/d/ [460] 

The integration 3 delds 

— ctf*/4a 

Here it is, incidentally, interesting to observe that for the choice a = 
one has 


[ 462 ] 
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and 

«*/2 

- vs 

In other words, the time function and its transform are identical except 
for a scale factor. In particular, if the transform is defined as ^(«), 
according to Eq. 365 and the Fourier integrals 366 and 367, the scale 
factor becomes unity and the time function is identical with its trans- 
form. This situation may also be achieved by using the integrals 369 
and 370, for which the transform (considered as a function of the cyclic 
frequency / = 03 / It) is defined in terms of ^(w) by Eq. 368. Thus, with 
a in Eqs. 459 and 461 equal to and Eq. 368 being used, the time func- 
tion reads 

hit) = 6“'*^ [464] 

and its transform becomes 

!(/) = [465] 

m 


t/^ 

Fig. 40. The error function used in consideration of the singularity functions. 




Returning now to the Eqs. 459 and 461, letting a = v/ a and multiply- 
ing the resulting time function and its transform by the factor 1/Va, 
one has for the time function 


and for its transform 


fit) = ^ 

V a 

[466] 

o«*/4» 

*<")- 2, 

[467] 


This time function is plotted in Fig. 40. Since the area imder the time 
function equals 2x^(0), it is clear that this area equals unity independent 
of the value of the parameter a. From Fig. 40 it is seen that as a becomes 
smaller the curve for/(/) becomes taller and narrower and, in the limit 
a — > 0, has the character of the unit impulse wo(0* At the same time one 
recognizes, from the form of Eq. 467, that g(«) in the limit a 0 becomes 
equal to the constant value l/2ir, and hence identical with the transform 
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v(u)) of the unit impulse. Thus the first of the sequence of singularity 
functions is obtained from /(/) of Eq. 466 for the limit a — >0, and its 
transform is found from Eq. 467. 

The remainder of the singularity functions are obtained from the 
successive derivatives of the function /(/) of Eq. 466 for the limit a — > 0, 
and the corresponding transforms of this sequence of functions follow 



Fig. 41. The first derivative of the error function used in consideration of the unit 

doublet. 

from the use of the statements 389 and 392 in connection with the 
transform 467. For example, the first and second derivatives of Eq. 466 


read 

dt~ 

[468] 

and 

d^f - a) 

dl^ ~ 

[469] 

whereas the corresponding transforms are given respectively by 



«'(») =>«(«) = 

[470] 

and 


[471] 


These time functions are shown in Figs. 41 and 42, from which it is 
clear that for a 0 they have the general character of the singularity 
fimctions Wi(/) and %(/). In other words, the function 468 approaches 
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the unit doublet as a approaches zero, and the function 469 approaches 
the singularity function of order two. 

It is interesting to compare the time functions of Figs. 34, 38, and 39, 
respectively, with those shown in Figs. 40, 41, and 42, and observe that, 
although the one set of curves is rectangular and the other set is smooth, 
both sets approach the same sequence of singularity functions when 
suitable limiting processes are carried out. This is evident from the fact 
that the transforms 467, 470, and 471 become identical with 
j(A}v{(a), and (jo})h(ci)) in the limit a — > 0. 



Fig. 42. The second derivative of the error function used in consideration of the 
singularity function of order two. 

Since the error function and all its successive derivatives are smooth, 
use of them in the derivation of the singularity functions does not in- 
volve the mathematically doubtful steps encountered when the rectangu- 
lar pulse and its derivatives are used for this purpose. 

26 . Relation to contour integrals 

In applying the Fourier method of analysis to practical problems, one 
frequently encounters functions g(w) which are quite complicated, and 
for which the evaluation of the synthesis integral 350 presents some 
difficulty. Although the function g(«) may be complex, it should be clear 
that the integration is carried out with respect to the real variable «, 
and hence is essentially an integration of a function of a real variable. 
Inspection of the integral representation of g{(a) according to Eq. 351, 
however, discloses that the variable w occurs only in the exponent of 
the exponential function and is there associated with the operator j. 
This fact shows that the transform g(w) may alternatively be regarded as 
a function of j<a and written in the form g(j(a). If this is done, and the 



548 


FOURIER SERIES AND INTEGRALS 


ICk. VII 


synthesis integral 350 is rewritten in the modified form, 

/(<) = jXy- 

it appears that this process of modification may be carried a step further 
through introducing the formal change of variable 

X =y« [473] 

writing Eq. 472 in the form* 

m = \ g{\)d\^^ [474] 

J U~j « 

and, regarding g(X) as a function of a complex variable, 

\ = <T + joi ' [475] 

evaluating the integral 474 by the method of complex integration dis- 
cussed in Art. 15 of the previous chapter. This method of integration, 
which is thus made available for the evaluation of the synthesis integral, 
holds promise for the simplification of many problems which present 
almost insurmountable difficulties unless one is exceptionally skilled in 
the art of real integration. 

Before this method of dealing with the synthesis integral may be 
utilized, however, it is necessary to clarify several significant points 
which, in the above formal steps, are left in a somewhat doubtful state. 
First it is necessary to assure oneself that the complex transform g(X), 
which is obtained from the ordinary Fourier transform by the simple 
expedient of replacing jw by a complex variable X, is in fact the analytic 
continuation of the function g(j«) into the complex domain. Sudi a 
justification is called for because the Fourier integral 351 defines the 
complex function g only in terms of the real variable w, or one may say 
that it defines the function g(X) only for values of X on the imaginary 
axis of the X-plane. In other words, the Fourier integral 351 does not 
establish the existence of the function g(X) for all points in the X-plane. 
In fact if one writes the integral 351 in the form 

[476] 

one immediately recognizes that this integral converges for / < 0 only 
for points in the left half of the X-plane and for / > 0 only for points in 
the right half of this plane. 

^Rewritten in terms of the variable X, the Fourier integrals are commonly referred to as 
Laplace’s integrals and g(X) as the Laplace transform 
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The existence of g(\) for all complex values of X is, however, readily 
established with the help of the principle of analytic continuation and the 
uniqueness theorem for analytic functions (see Art. 11 of Ch. VI). Thus, 
suppose that a function g{ju)) is given (that is, the function g(X) for 
X = ju>) and let the problem be to find its analytic continuation into the 
X-plane. Suppose furthermore that somehow a function F{\) of the com- 
plex variable X is found which is identical with g(X) for all points on the 
imaginary axis (actually it is only necessary that F{\) and g(X) be 
identical for all points on an arbitrarily small portion of the imaginary 
axis or for an infinite number of discrete points having a limit point on 
the imaginary axis). Then, according to the uniqueness theorem, the 
functions F{\) and g{\) are identical everywhere in the X-plane, and 
hence F(X) or g(X) is the desired analytic continuation of g(yco). 

Returning now to the integral 474, one finds that interpretation of it in 
terms of the method of complex integration raises a second pertinent 
question. The method of complex integration requires that the path of 
integration be in the form of a closed contour. According to the integral 
474, the path is the entire imaginary axis, from minus infinity, through the 
origin, to plus infinity. Since infinity is regarded as a single point (this 
view is most easily appreciated through considering the complex plane 
replaced by its associated complex sphere), one may say that the closed 
contour requirement is met by the integral 474. A difficulty arises, how- 
ever, because the integrand has an essential singularity at the point at 
infinity, since it contains the factor In the immediate vicinity of such a 
singularity a function is capable of assuming any assigned values (see 
Art. 13 of Ch. VI), and hence the method of passing through such a 
point cannot easily be disposed of. Unless the path of integration in the 
integral 474 is closed by passage through or around the point at infinity, 
the methods of complex integration cannot be applied, and yet the proc- 
ess of supplying this gap in the path of integration must be accomplished 
in a way which does not affect the value of the integral. 

For the following argument it is necessary to assume that g(X) is a 
rational function and that, for large values of X, it vanishes at least as 
1/X. Modifications in the procedure which are called for when g(X) does 
not fulfill these conditions are more appropriately considered later. In 
the immediate vicinity of the point at infinity the integrand is then 
essentially represented by the factor e^VX. If the X-plane is, for the 
moment, regarded as replaced by its associated complex sphere, ac- 
cording to the method of stereographic projection, one now contemplates 
by-passing the point at infinity by means of a path increment in the form 
of a small semicircular detour concentric with this point. In the ordinary 
X-plane this detour corresponds to a semicircular path of very large 
radius with the origin as a center, as indicated in Fig. 43. This large 
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semicircle lies in the right or left half of the X-plane according to whether 
the point at infinity is by-passed on the right or on the left. 

In order that the detour shall contribute a negligible increment to 
the value of the integral, it is clear that c^^/\ must at all events remain 
bounded for points on the detour. For such 
points it is also clear that X has large values 
and that these values become infinite as the 
radius of the semicircular path about the point 
at infinity is made smaller and smaller, that is, 
as the radius K of the corresponding path in the 
X-plane of Fig. 43 becomes larger and larger. 
In order for \ to remain bounded, it is 
seen, therefore, that, for t < 0, the detour 
must lie in the right half plane where the 
real part of X is positive and that, for / > 0, 
the detour must lie in the left half plane where 
the real part of X is negative. 

It remains to show that, with this choice 
of detours for / < 0 and / > 0 respectively, the contributions to the 
integral 474 due to these added path increments are negligibly small. 
That is, one must show that the integral 

Jd. (477) 

extended over the semicircular paths as indicated in Fig. 43 has a negli- 

gible value for a sufficiently large value of R. For ix)ints on the semicircle 

X = = R (cos S +j sin d) [478] 

and 

j=jde [479] 

SO that the integral 477 becomes 

I=jf [480] 

Since the factor has unit magnitude, it is clear that the value of 

the integral 480 extended over either of the two semicircular paths is 
certainly less than it would be if this factor in the integrand were omitted. 
As far as the magnitude of the integral is concerned, one may also drop 
the factory and have 

|/| < J 



Fig. 43. The paths for 
replacing the Fourier in- 
tegral along the imaginary 
axis by a contour integral. 


[481] 
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According to the choice of paths for / < 0 and / > 0, it is observed that 
for / < 0 the limits of integration are from S = ir/l to0 = — 7r/2, and that 
for / > 0 they are from = 7r/2 to 0 = 37r/2. For either path, the expo- 
nent Ri cos 6 is negative, and since jcos^l ^ 1, one may state that for 
I < 0 , 


1/| < 2j 

1) 

[482] 

and for t > 0, 


pZir/2 / 

|/| < <2j^ 

-KU« ^ ’r(l - e-®'*') 

de = — ;;n 

< R\t\ 

[483] 


For any nonzero \t\, the value of \I\ may, therefore, be made arbitrarily 
small through choosing a sufficiently large value of R, 

According to the theory of contour integration, one recognizes that 
the radius R need lie cliosen only large enough so that the semicircular 
paths in Fig. 43 together enclose all the poles of the integrand. Since 
is an entire function, these poles are those of the rational function g(\). 
As a simple illustrative example, let the time function /(/) be the unit 
step w_i(/) with the transform ^(X) = l/27rX. The s5mthesis integral 
then reads 

1 pj CO 


In evaluating this contour integral one must again be reminded of the 
fact (discussed in Art. 24) that the transform of the unit step function 
is to be regarded as the limit of the function 


«(x) = 


1 

27r(a -|- X) 


[485] 


for a — » 0. The pole of this function lies at the point X = — a, and since 
a is an arbitrarily small but nevertheless nonzero quantity, one observes 
that the pole of the integrand in the integral 484 must be regarded as 
lying, not at the origin of the X-plane, but slightly to the left of this 
point. The closed contour for / < 0, therefore, does not enclose this 
pole, but the one for / > 0 does. Inasmuch as the residue of the integrand 
in this pole is unity (see Art. 15, Ch. VI, for the evaluation of residues), 
one readily recognizes that the integral 484 correctly represents the 
unit step function. 

The necessity of writing the transform for the unit step in the form 
given by Eq. 485 and considering the limiting process indicated by 
a 0 may be avoided if the path of integration in the vicinity of the 
origin is regarded as modified as shown in part (a) of Fig. 44. Instead of 
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passing through the origin, the path along the imaginary axis avoids the 
origin by passing to the right of it along a semicircular detour of vanish- 
ingly small radius. That a detour of this sort 
is necessary follows from inspection of the 
integral 484 inasmuch as a pole of the inte- 
grand lies upon the path of integration. How- 
ever, unless one is aware of the limiting proc- 
ess which, in the limit, causes this pole to 
be located at the origin, one cannot know 
whether to by-pass this pole on the right or 
on the left. A knowledge of the limiting proc- 
ess which is necessary for the proper inter- 
pretation of the Fourier transform for the 
unit step function is thus seen to be neces- 
S3.ry also for the removal of ambiguity in 
the reverse process of regaining this time 
function from its transform. 

It is possible for the integrand in the contour integral 474 to have 
several poles on the imaginary axis. A case of this kind arises when the 
time function has the form 

/(/) = «-i(/) • cos {(aol + 0) [486] 

which represents a steady sinusoid starting at / = 0. With the trigo- 
nometric function replaced by its exponential equivalent, the function 
486 may be written 

/(/) = w_i(0 • [487] 




t 


x*o. 






(a) 


f 




Fig. 44. Modification of the 
path of integration in the 
vicinity of poles of ^(X). 


Utilizing the statement 385, one finds for the corresponding transform 




I 


in which 




Xo =jt»o 


p->* 


1 


4t X + Xo 


[488] 


[489] 


The integrand in the contour integral for this g(X)-function evidently 
has poles at the points X = d=y«o on the imaginary axis. In the evalua- 
tion of this contour integral, these poles must evidently be by-passed in 
the manner indicated in part (b) of Fig. 44. The residues of the integrand 
in these poles are seen to be respectively 

— • and — — [490] 

4t 4t 

whence, observing agam the selection of proper contours for / < 0 and 
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/ > 0, one sees that the evaluation of the contour integral in this case 
correctly yields the time function 486 or 487. 

In each of the two examples just discussed, one observes that g(X) 
does fulfill the condition of vanishing at least as strongly as 1/X for large 
values of X. It is appropriate at this point to consider in greater detail the 
necessity for this condition. One should here recall the discussion in 
Art. 8 relative to the character of Fourier coefficients for periodic func- 
tions which either are discontinuous themselves or possess discontinuities 
in their derivatives of the first or higher order. It is pointed out there 
that if the time function is discontinuous, the Fourier coefficients can 
become smaller no faster than 1/ voi, in which v is the order of the harmonic 
coefficient and w is the fundamental angular frequency. If the function 
is continuous but its first derivative is discontinuous, the coefficients can 
become smaller no faster than and so forth. 

Since these statements must obviously remain true as the period of the 
periodic function is made larger and larger, they apply also to the Fourier 
integral representation of a transient time function and its transform. 
The singularity functions Wn(0> whose transforms are l/29rX” are ap- 
propriate examples of this property. Inasmuch as the converse of these 
statements is evidently also true, one observes that the restriction that 
g(X) shall vanish for large X at least as fast as l/X is seen to imply that the 
corresponding time function shall possess nothing worse than discon- 
tinuities. In other words, the method of contour integration is applicable 
to the evaluation of the synthesis integral only if the corresponding time 
function contains terms involving singularity functions of the order — 1 
or less. The impulse, for example, cannot be regained from its transform 
by the method of contour integration unless one employs special devices 
involving limiting processes similar to those used in the derivation of the 
transforms of such higher order singularity functions. 

From a practical point of view tliis restriction on the function g{\) 
is hardly serious since the behavior of a physical system never exhibits 
the properties of an impulse or its derivatives unless the data are de- 
liberately idealized. Moreover, one can, in such idealized cases, always 
apply a simple artifice to overcome the difficulty imposed by this re- 
striction. For example, suppose g(X) approaches a constant value for 
large values of X, implying that the corresponding time function contains 
an impulse. If the integrand is arbitrarily multiplied by 1/X, the state- 
ment 392 shows that the corresponding time function is replaced by its 
integral. Contour integration may then be applied, and the desired time 
function found through differentiating the result. In general, one may 
multiply the function g{\) by whatever power of 1/X is needed to obtain 
the proper behavior for large values of X and, after evaluating the contour 
integral, differentiate the result a corresponding number of times. 
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During this subsequent process of differentiation, one must observe 
the following precautions. If the time function resulting from the contour 
integration contains discontinuities, its derivative contains a corre- 
sponding number of impulses. For example, suppose a time function /(/) 
has a discontinuity of the value /i at / = /q. The first derivative of/(/) 
then contains the term h • «o(^ “ its second derivative contains the 
term h • iii{t — /o)> and so forth. Each discontinuity is treated in this 
manner whether it appears in the function /(/) itself or in any of its 
subsequently formed derivatives. Besides terms of this sort, the deriva- 
tives of /(/), of course, also contain terms representing the derivatives 
of the smooth portions of /(/). 

The second restriction which is placed upon ^^(X) in the above discus- 
sion of the resulting contour integration, namely, that ^(X) be a rational 
function, may with some reservations next be relaxed to the extent of 
allowing g(X) to be a meromorphic function. As pointed out in Art. 18 
of Ch. VI, this class of functions is more general than the rational ones 
in that the point at infinity may be an essential singularity. I'hus j^(X) 
is allowed to be a transcendental function, although its singularities in 
the finite X-plane must still be ordinary poles. 

This relaxation of the conditions imposed upon ^(X) requires further 
consideration of the process of contour integration from two aspects. 
These are concerned, first, with the effect of the essential singularity at 
infinity and, second, with the possibility of an infinite number of poles in 
the rest of the X-plane. A few simple examples will best illustrate how 
these matters may be dealt with. 

Suppose the time function is the rectangular pulse defined by the 
relations 360. Its transform is given by 


1 


sin X - 


dt = 


ttX 


[491] 


This is an entire transcendental function. Its only singularity is the one at 
infinity. Since the integrand in the contour integral 474 in this case has no 
poles at all, the entire process of evaluating this integral centers about the 
question of how the gap in the contour, which exists at the point at 
infinity, may be closed without affecting the value of this integral. 
Offhand, one may be tempted to conclude that the value of the integral is 
zero because the integrand has no poles. This conclusion i& false, however, 
inasmuch as it is based upon the tacit assumption that the closure of the 
gap in the path of integration at infinity is to be dealt with in the manner 
described for rational g(X)-functions. Such an assumption is presumptive. 

The clue which leads one in the right direction is found through re- 
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writing the function 491 in the form 


gW = 


2x\ 




MI2 _ 


xa/2^ 


whence the integrand in the integral 474 becomes 


2irX 




[492] 

[493] 


which, of course, is still an entire transcendental function. However, if 
the integrand is separated into two terms, each term is seen to have a 
simple pole at X = 0, and in the vicinity of the point at infinity to behave 
in the same manner as already described for contour integrals involving 
rational g(X)-functions except that the variable / is replaced respectively 
by (/ -f- 6/2) and (/ — 5/2). The paths which previously were chosen for 
/ < 0 and / > 0 are now chosen for (/ i 5/2) < 0 and {t ± 5/2) > 0 
respectively. Except for these changes, the integral for each term has the 
form of Eq. 484 for the unit step function. Hence one obtains the result 

/(O = M_x - I) [494] 

which is recognized without difficulty to meet the definitions 360. 

It should be observed that the question of how to close the gap in the 
path of integration at infinity is, in this example, resolved only after the 
integrand is separated into two terms, for the proper resolution with 
regard to one of these terms is different from that in the other. Unless 
the integrand is separated into two terms, it is obvious that the method of 
contour integration cannot be carried out for lack of an appropriate 
method of closing the path of integration. The method of contour integra- 
tion can be applied to other ^(X) -functions having essential singularities 
at infinity only if similar artifices can be de'/ised for dealing with this 
question. 

Regarding the possibility of encountering transcendental g(X)-fimctions 
having an infinite number of poles in the finite X-plane, one may consider 
the example in which is sought the current response to a unit step voltage 
at the driving point of a lossless open-circuited transmission line. Except 
for a constant multiplier, the g(X)-function in this case is of th^ form 

g(X) = [495] 

which has simple poles at the points 

X.= ±-'-y (V = 1,3,5, • ••) 

and an essential singularity at iqfinity. 


[496] 
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The question of how to close the gap in the path of integration at 
infinity is disposed of through observing that g{\) remains bounded for 
large values of X in the right or left half plane (which is not the case with 
the function 491 of the previous example). Hence large semicircles like 
the ones shown in Fig. 43 can be found on which the contributions to the 
contour integral for / < 0 and / > 0 are negligible. This question may, 
therefore, be resolved in a manner similar to that discussed for rational 
^(X)-functions. 

It remains to determine how one shall deal with the infinite number of 
poles of g(X) inasmuch as any semicircular path, no matter what its 
finite radius may be, cannot enclose all these poles. The difficulty pre- 
sented by the fact that these poles lie upon the path of integration is, 
incidentally, overcome through avoiding them by means of small detours, 
after the fashion shown in Fig. 44. This procedure is valid since, for a 
transmission line with some loss, however small, the corresponding poles 
lie in the left half of the X-plane. 

The residues of the integrand in the integral 474 with the g(X)-function 
of Eq. 495 are found to be given by 

_ sw^ = ± — . [ 497 ] 

X,, virj 

The significant point about this result is that the residues vary inversely 
as Hence they become smaller and smaller for poles which lie more 
and more remote from the origin of the X-plane. The terms in the corre- 
sponding time function, therefore, become negligibly small for very 
remote poles. Since any number of terms in the desired time function 
are readily calculated, the question of choosing a sufficiently large radius 
R for the semicircular paths of Fig. 43 evidently depends upon the 
degree of approximation to which the result should be determined. The 
contour integral yields the time function in the form of an infinite series, 
which incidentally is recognized as being the Fourier series for a square 
wave. 

By making use of the statement 392, and observing that multiplying 
g(X) by 1/X“ has the effect of multiplying the corresponding residues by 
1/Xv", one obtains a much more rapidly convergent series, but this effect 
is canceled by the subsequent «-fold differentiation. In practical prob- 
lems in which the form of the resulting time function is not recognized 
from inspection of its series representation so easily as it is in the present 
example, a reasonable number of terms usually suffice for a sufficiently 
good approximation provided the series converges. The question of the 
convergence of the series can always be examined by means of the expres- 
sion obtained for the residues. 

In some practical problems one may also encounter g(X)-f unctions 
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which are multivalued. For example, the determination of the response 
of an artificial transmission line to an applied unit step voltage leads to 
the transform 


g^) = 


1W\^ + + X)" 


[498] 


in which n (the nxunber of line sections) is an integer. The inverse trans- 
form is given by the integral 






d\ 


iTcjJ-i* + \Y 

The first step in the process of simplifying this integral is to let 


X 

- = 2e; 
a 


which converts 499 into the form 

= sX; 


^wat 


^ Vw^ + + 1 + wY 

Next one introduces the change of variable indicated by 

whence 


■dw 


[499] 


[500] 


[SOI] 


[502] 


[503] 


‘and, from the addition or subtraction of these two equations, one finds 


z y/w^ + i + .0 

[504] 

and 


i = y/w^ + 1 — w 
z 

[505] 

Forming the differential of Eq. 504 yields 


y/w^ + \ + w 
az = T— : dw 

[506] 

y/w^ 1 


whence, using Eq. 504 again, one has 

dz _ dw 


[ 507 ] 
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By means of Eqs. 502, 504, and 507, the integral 501 assumes the greatly 
simplified form 

I p Az-l. i) al.2 

in which the contour C in the ;3-plane must be chosen to correspond to 
the contour in the i£'-plane which is implied in the evaluation of the inte- 
gral 501 by the method of complex integration. 

Because of the multivalued character of the integrand in Eq. 501, 
the determination of an appropriate closed contour must be considered 
with some care. First it is obser\"ed that for large values of w the trans- 
form g{w) varies as and hence, even for n = 0, it fulfills the 

requirements stated earlier regarding the method of closing the gap in 
the path of integration at the point at infinity. Hence one may again 
contemplate closing this gap in the manner showm in Fig. 43 for / < 0 
and / > 0 without affecting the value of the integral. 

Next it is seen that for integer values of w, the Riemann surface for 
the integrand of Eq. 501 has two leaves. The values w = doj are branch 
points, and the portion of the imaginary axis between these is regarded 
as a branch cut. Since the point at infinity is not a branch point, the 
two leaves of the Riemann surface remain separate in this vicinity, and 
the path of integration around this point, therefore, remains on one of 
these leaves. 

The branch points, which also are simple poles of the integrand (see 
the discussion immediately following Eq. 256 in Art. 18 of Ch. VI), must 
be avoided in the manner already described by means of small semi- 
circular detours in the right half plane. In so doing, the path of integra- 
tion also remains on the same leaf of the Riemann surface (as discussed 
in greater detail below) and the condition that the contour be closed is 
fulfilled. 

In order to determine the contour C in the z-plane for the integral 508, 
it is necessary to consider in greater detail the substitution 502 and its 
inverse 504. For this purpose it is effective to determine the conformal 
map in the ?«;-plane corresponding to orthogonal families of concentric 
circles and radial lines symmetrical with respect to the origin in the 
2 -plane. With 

2 = [509] 

the loci in the 2 -plane are defined by r = const and 0 = const. Substitut- 
ing into Eq. 502, one has 

f t ^ 

V) — u jv — - (cos ij) +j sin 0) — — (cos ^ / sin <t>) [510] 

2 2r 
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whence 


cos^ 




[511] 


Eliminating <f> on the one hand and r on the other, one obtains respectively 

C .2 ..2 


and 


K'-]) 


= 1 




sin^ 0 cos^ <l> 


= 1 


[512] 


[513] 



Fig. 45. Conformal representation of the substitution given by Eqs. 502 and 504. 


For various values of r, Eq. 512 represents a family of confocal ellipses, 
with foci at the points u = Oy v = ±:j (that is, w = db^). These loci are 
shown in Fig. 45. For r = 1, the ellipse degenerates into the doubly 
traversed portion of the imaginary axis between the points w = zty. 
The same ellipse is evidently obtained for reciprocal values of r, a very 
large or a very small value of r yielding a large ellipse with very little 
eccentricity, that is, one which is very nearly a circle concentric with the 
origin. One observes that the interior of the unit circle in the 2 -plane 
(r < 1) is mapped upon the entire zc^-plane and that the exterior of the 
unit circle in the 2 -plane (r > 1) is also mapped upon the entire w^-plane. 
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The multivalued character of the function 504 is thus evident, inasmuch 
as two Xc'-planes (the two leaves of a Riemann surface) are needed to 
map uniquely all points in the s-plane. The two leaves of the Riemann 
surface in the Tt^-plane represent regions upon which the interior and 
the exterior of the unit circle in the 2 -plane are mapped respectively. 
The boundary between these two regions in the 2 -plane is the unit 
circle r = 1 ; in the i(;-plane it is the degenerate ellipse. The latter is the 
branch cut in the xi:?-plane through which one passes from one leaf of the 
Riemann surface to the other. 

For various values of <^, Eq. 513 represents a pair of families of confocal 
hyperbolas, which are images of each other about the real axis. These 
loci, which are also showm in Fig. 45, are orthogonal to the ellipses 
defined by Eq. 512, and their foci also lie at the points w = ±j\ The 
asymptotes of any hyperbola on one of the leaves 
of the Riemann surface njake angles with the 
positive real axis which are equal to the values 
of 4> appropriate to the corresponding branches 
of that hyperbola. On the other leaf, the angle 
between an asymptote and the positive real 
axis jis IT — 0. Suppose the top leaf to be that 
one on which the angle is <t>, and the lower one 
that on which the angle is tt — </>. Then if, for 
example, the branch of the hyperbola in the top 
leaf for 0 = 30® (this is in the first quadrant) 
is traversed in the direction toward the branch 
Fig. 46. Path in the ^.^t, one finds, after following this hyperbola 
z^plane corre^nding to the branch cut (into the second quad- 

according to the substitu- rant) that one is now on the lower leaf but still 
tion shown in Fig. 45. on an hyperbola to which 0 = 30® is appropri- 
ate. If, instead of passing through the lt)ranch 
cut, one imagines jumping over it so as to remain on the top leaf, one 
finds oneself on an hyperbola (in the second quadrant) to which 
0 = 180® — 30® = 150® is appropriate. 

With these properties of the substitution 502 in mind, it is easily 
appreciated that tie path in the 2 -plane corresponding to the imaginary 
axis in the 7£;-plane, traversed from —j qo to -hj « (avoiding the points 
w = ity by remaining slightly to the right of them), has the form shown 
in Fig. 46. The process of closing the path in the 2 «;-plane by means of a 
large semicircle in the manner shown in Fig. 43 corresponds to closing the 
pati of Fig. 46 in the 2 -plane in the same manner.* The large semi- 

*It Is true, of course, that a large drcle concentric with the origin in the 2 &*plane is not 
exactly (although very nearly) also a circle in the z-plane. This fact is, however, unimportant 
inasmuch as the argument regarding the closure of the path of integration does not require 
that the path increment in question be circular in form. 
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circles for / < 0 and t> 0 also lie in the right and left half planes respec- 
tively. The contour C in the integral 508 is thus determined. 

For / < 0 the value of this integral is evidently zero, for the integrand 
has singularities only at the points z = 0 and 2 = 00 . For / > 0 the con- 
tour C encloses the one singularity at 2 = 0. This contour may, therefore, 
be given any other form, as long as it encloses the origin in the 2 -plane. 
Choosing the unit circle for the path C and writing in conformance with 
Eq. 509, 



z = e’* 

[514] 

one has 

dz . , 

-=jd4> 

[515] 

and 


[516] 

so that the integral 508 becomes 




[517] 


This result is seen to be identical in form with the integral representation 
for the Bessel function, as given by Eq. 307. Hence one has for / > 0, 

j(sxi) = /n(a0 [518] 

It may be of interest to observe that the integral 508 alternatively 
represents the coefficients Jn hi a Laurent expansion of the function 
^(*-i/*)a //2 about its essential singularity at 2 = 0. One recognizes this 
.fact from a comparison of Eq. 166 of Ch. VI with the integral 508, 
remembering that C is a contour enclosing the origin in the 2 -plane. As 
discussed in Art. 7, the substitution 514 converts the Laurent expansion 
into a complex Fourier series. One thus obtains the Fourier series rep- 
resentation for the function 303 dealt with in Art. 16. 

An additional interesting feature about the transformation from the 
integral 501 to the integral 508 by means of the substitution 502 deserves 
special mention. In examining the integrand in the integral 501 one 
observes (as pointed out above) that the points w = ±j n ot only are 
branch points but also are simple poles, since the factor + 1 be- 
comes zero there. As the corresponding points in the 2 -plane, which from 
Eq. 504 are seen to be z = ±j, the integrand in the equivalent integral 
508, however, does not possess singularities. The reason for this peculi- 
arity lies in the fact that the function w{z) represented by the substitu- 
tion 502 possesses saddle points (see Arts. 14 and 18, Ch. VI) at z = ±j. 
This fact becomes evident upon the forming of the derivative of Eq. 502, 



562 


FOURIER SERIES AND INTEGRALS 


[Ch. vn 


which reads 



This derivative has simple zeros for z = ±7. Since the second derivative 
does not vanish at these points, they are saddle points of the first order. 
The vanishing of dw/dz at the points s = ±jy corresponding to w = dtzj\ 
may also be seen from Eq. 507. Since the quantity dz/z = dw/Vw^ -f 1 
remains regular in these points, the integral 508 does so likewise. For this 
reason it is unnecessary that the contour C of Fig. 46 be modified so as to 
avoid the points z — ±7; the path of integration in the c-plane may pass 
through these saddle points. 

It may also be of interest to recognize that the integral 508, except 
for the factor 34, is equivalent to Sommerfeld s integral 306. In place of 
the relation 514, one uses the substitution 

c = [520] 

in which r is regarded as a new complex variable. Writing p for at and 
dropping the factor 34> one has the integral 

Zn{p) = - f [521] 

ttJl 

This is simply an alternative form for Sommerfeld ’s integral as giv^en by 
Eq. 306. The latter form is obtained from Eq. 521 by making the addi- 
tional change of variable 

f = ^ - r [522] 

The minus sign resulting from the fact that d^ = —dr is unimportant 

inasmuch as it may evidently be canceled through traversing in opposite 
direction the path of integration, which as yet is not specified. 

For the various kinds of cylinder functions* defined by the integral 
521, the path of integration L begins and ends at infinity. In order to 
insure the convergence of the integral, it is necessary (assuming p > 0) 
that the portions of L which extend toward infinity do so within regions 
of the r-plane in which the real part of 7 sin t remains negative. Letting 
r = ^ +jrit 

j sin (0 + 717) = — cos <l) sinh 17+7 sin </> cosh rj [523] 

*For the demonstration showing that the function 521 formally satisfies Bessel’s equation 
(which is dissociated from the present discussion), the reader is referred to the literature on 
this subject, for example, R. Courant and D. Hilbert, Methoden der mathematischen Physik, 
I (Julius Springer, 1924), 382, or E. T, Copson, Theory of Functions of a Complex Variable 
(Oxford, 1935), 313. 
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whence it is readily recognized that the regions in which the real part is 
negative are those shown cross-hatched in Fig. 47. Paths such as those 
labeled Li and L 2 may, therefore, be regarded as closing upon them- 
selves at infinity, so that the principles of contour integration become 
applicable. 

The detailed form of such a path within a cross-hatched region may, 
therefore, be modified at will without affecting the value of the integral. 
Thus the path L\ may alternatively be assumed to lie along the imagi- 



Fig. 47. Appropriate paths of integration Fig. 48. Paths in the z-plane 
passing through saddle points, used in the corresponding to modified ver- 

approximate evaluation of Sommerfeld^s sions of L\ and hi in the r-plane 

integral. of Fig. 47. 


nary axis from 1; = « to 77 = 0, thence to lie along the real axis from 
0 = 0 to 0 = TT, and from there to proceed toward 77 = — » along the 
vertical line <t> == t. Similarly, the path may be assumed to lie along 
the line </> = — tt from 77 = — <» to 77 = 0,' thence to lie along the real 
axis from </> = — tt to </> = 0, and finally to proceed toward 77 = 00 along 
the imaginary axis. 

The paths Ci and C2 in the z-plane, corresponding respectively to these 
modified versions of the paths L\ and L 2 in the r-plane, are shown in 
Fig. 48 (with due allowance for slight departures necessitated by drawing 
both paths in the same figure). 

The specific functions defined by the integral 521 for the paths Lx and 
L 2 are referred to as the Hankel functions or also as Bessel functions of 
the third kind. These are 

= - f 


[524] 
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and 

= - f [S25] 

X *lLi 

For real as well as complex values of p, these functions are complex. 
The conjugate value of Hn^^\p) for real values of p is given by 

(p) = - [ ““ dr [526] 

in which every point on the path Li is the conjugate of a corresponding 
point on Li, that is, Li is the image of Lj with respect to the real axis. 
If r, the variable of integration, is replaced by — r, every point on the 
path of integration is replaced by its negative; that is, the path of integra- 
tion becomes replaced by its image about the origin. If the latter path is 
denoted by — Li, one may write in place of Eq. 526 

(p) = ^ f <i( - t) [527] 

It is now observed that the path — Li is the image of Lj about the 
imaginary axis. Reference to Fig. 47, therefore, shows that —Zi is 
identical with the path L 2 except for a reversal of the direction in which 
it is traversed. Changing this direction merely reverses the algebraic 
sign of the result. Hence one has 

= - f dr = [528] 

XvLj 

that is, for real values of p, the Hankel functions of the first and second 
kind are conjugate complex. 

The sum of these two functions may be expressed by a single integral 
of the form 521 in which the path L is the resultant of the paths Li and Zr 2 . 
Reference to Fig. 48 shows that the corresponding resultant path in the 
2 -plane‘ is the unit circle enclosing the origin. According to the preceding 
discussion this path yields the Bessel function 7n(p), hence 

7n(p) = [529] 

For real values of p, one may regard the Bessel function as the real part 
of either of the Hankel functions. The relation 529, however, is by defini- 
tion assumed to hold for complex as well as for real values of p. 

For the sake of completing the present picture, it may be mentioned 
that the so-called cylinder functions of the second kind (also called 
Neumann functions) are given in terms of the Hankel fimctions by a rela- 
tion complementary to Eq. 529, namely. 


[ 530 ] 
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One observes that the Hankel functions are analogous to the exponential 
functions and the Bessel and Neumann functions to cos x and 
sin X respectively. 

Asymptotic expressions valid for large values of the argiunent p may 
be obtained through evaluating the integrals 524 and 525 by the so-called 
‘‘ saddle-point ” method. In terms of the variable r defined by the sub- 
stitution 520, the function 502 reads 

w = j sinr = u{4>,r)) +72^(0,17) [531] 

The saddle points which, in the 2-plane, occur for z = are located 
in the r-plane at the pxDints r = :izTr/2, Reference to Fig. 47 shows that 
the paths Li and L 2 pass through these points. There the value of u is 
zero, whereas on either side of a saddle point u is negative. The exponen- 
tial function 

gjpsinr ^ ^(u+/®) [-532] 

appearing in the integrals 524 and 525, has the magnitude 

|gip8inr| ^ g J 1-533] 

the value unity obtaining at a saddle point. 

If one is mindful of the general character of the contours in the r-plane 
defined by w = constant and v = constant, as discussed in Art. 14, 
Ch. VI (in particular, see Fig. 10 of Ch. VI for s = 2), one observes 
that if the path of integration is chosen to coincide with that contour 

V = constant which passes through the saddle point, the function u, and 
hence the exponential function given by Eq. 533, experience their most 
rapid rate of growth and subsequent decay. This fact is readily appre- 
ciated if one utilizes the analogy of regarding the loci u = constant as 

. being contour lines in a mountainous terrain, and the orthogonal loci 

V = constant as indicating the direction of the gradient (direction of 
steepest ascent) in this terrain. The saddle point has the character of 
a mountain “ pass,” and the contour v = constant which passes through 
the saddle point represents the shortest route along which one may scale 
the height of the “ pass ” and descend into the valley beyond it. Clearly 
then, if this route is chosen as the path of integration, the values of the 
exponential function 533 pass continuously and most rapidly from the 
negligibly small magnitudes which obtain for points within the shaded 
regions of Fig. 47 remote from the origin, through their maximum (which 
occurs at the intersection of the path with the real axis), and back to 
negligibly small magnitudes again. 

It is also readily appreciated that the portion of the path (in the 
vicinity of a saddle point) throughout which the exponential function 
has appreciable values becomes shorter as p becomes larger. For large 
values of p, therefore, the principal contribution to the value of either 
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of the integrals 524 or 525 is furnished by a rather short path increment 
in the immediate vicinity of the respective saddle point. An integration 
over such a short path increment alone then yields substantially the 
correct value of the desired function, the approximation becoming 
asymptotically better and better the larger the value of p. 

In order to be able to carry out such an integration, one must first 
determine the direction (in the r-plane) of the contour v = constant 
passing through the saddle point. From Eqs. 523 and 531 one has 

w = — cos sinh rj 


and 


[534] 


V = sin 4> cosh rj 


At the saddle points r = ±ir/2 (that is, <t> = dzir/2, rj = 0), so that, as 
pointed out above, w = 0 and 


= 1^0 = j^sin <5 cosh = 


ztl 


Along a contour v == constant one has 

dv = 0 = cos (t> cosh rjd<f> + sin </> sinh rj drj 

from which 

d<t> 

— = — tan <t> tanh rj 
drj 


[535] 

[536] 

[537] 


By use of Eq. 535 the values of this derivative at the saddle points are 
foxmd to be 

^ “ sin 0 = =f1 [538] 

arj 

in which the minus sign applies to the saddle point at r = +t/2 and 
the plus sign to the one at r = — ir/2. Hence the path Li, which passes 
through the saddle point at t = ir/l, should -do so at an angle of —45 
degrees with respect to the real axis, whereas the path L 2 should pass 
through its saddle point at r = — t/ 2 at an angle of +45 degrees. The 
paths shown in Fig. 47 comply with these conditions. 

Next it is necessary to determine the detailed behavior of the function 
«( 0 ,i 7 ) in the immediate vicinity of the saddle points. This behavior 
must be expressed in terms of a variable which represents length measured 
along the respective paths of integration through these points. If this 
variable is denoted by 5 , the above determination of the path increments 
through the saddle points shows that one may write for the immediate 
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vicinities of these points 

T = ± ^ [539] 


whence 


ze; = 7 sin r = ±y cos [540] 

Here s is regarded as a small quantity, so that one has approximately 

ws ±7^1 - y = ±i(i *-/y) = “ y 

and, therefore, 



[542] 


The value of v, according to the result 541, of course, agrees with that 
expressed by Eq. 535. The exponential factor 532 finally becomes 

giPBin / ^ fQj. 1^1 ^ 1 |-543] 

in which the plus sign in the exponent is to be used in the integral 524 
and the minus sign in the integral 525. 

The exponential factor which also appears in these integrals, 

may, for the short path increment over which the integration is ex- 
tended, be regarded as slowly variable compared with the factor 543, and 
hence as replaceable by its values at the saddle points. These are 

^^:jnrl2^ 

Observing, according to Eq. 539, that 

dr = ds [544] 


‘one then has for the desired asymptotic expressions for the integrals 
524 and 525 


and 


(p) ^ jr^‘ c-p»’/2 

T V— « 

(P) S - ds 

IT V— e 


[545] 

[546] 


in which It is the length of the small path increment over which the 
integration extends. If the integrand is plotted versus s for large 

values of p, one finds that the area under this curve is confined sub- 
stantially to the immediate vicinity of the origin (s = 0). In other words, 
the total area under this curve (which is obtained through integrating 
from j=— ootoj=<») differs from the area within a small region in 
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the vicinity of the origin by an amount which becomes smaller and smaller 
as p becomes larger and larger. The total area is given by 

[547] 

Hence one has for the desired asymptotic expressions for the Hankel 
functions 

(p) S . P • 1-548] 

\ irp 

and 

(p) ^ J- ■ [549] 

\ Jrp 

According to Eq. 529, a corresponding asymptotic form, valid for 
large p, is obtained for the Bessel function 

7n(p) = cos ^P - [550] 


For n = 0 and n = 1 this result yields the formulas given by Eqs. 322 
and 323 in Art. 16. 

It may be well to point out that the approximate expressions just 
derived do not yield accurate results if the parameter n as well as the 
argument p is large, that is, if n and p are of the same order of magnitude. 
The truth of this statement is readily seen from the fact that if n is also 
large, the factor appearing in the integrals 524 and 525, may no 
longer be regarded as essentially constant throughout the path increment 
over which the integration extends. 

For a more general treatment of the present problem, one begins by 
setting 

p = an [551] 

and rewriting the integrals 524 and 525 in the forms 


and 


= - 1 





f dr 

% 


[552] 

[553] 


In considering the saddle-point method of integration one then lets 

w -j{a sin T — t) = n +jv [554] 
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The saddle points are those values of t for which 


yielding 


dw 

— = j(a cos T — 1 ) = 0 
dr 


cos T = 


1 

a 


[555] 

[556] 


For large values of a (that is, for p » w), Eq. 556 yields very nearly 

cos T = 0 or T = ztTT [557] 

which are the saddle points of the function 7 sin r considered above. 

The complete treatment of this problem (a being assumed real) re- 
quires separate consideration of the cases a > I, a = 1 , and a < 1. One 
obtains, in this manner, representations for the cylinder functions in the 
form of semiconvergent series of which the approximate results derived 
above are the first terms. The further detailed discussion becomes too 
specialized to be included under the heading of the present article,* in 
which the primary objective is to consider the essential principles in- 
volved in the use of complex integration for the evaluation of inverse 
Fourier transforms. 


PROBLEMS 

1. Make sketches of periodic functions which have the following specific char- 
acteristics: 

(a) The Fourier series contains only sine terms but all harmonics are present. 

(b) The Fourier series contains only sine terms and only odd harmonics. 

(c) The Fourier series contains only cosine terms but all harmonics are present. 

(d) The Fourier series contains only cosine terms and only odd harmonics. 

(e) The Fourier series contains sines and cosines but only odd harmonics. 

‘ (f) The Fourier series has the property that the odd harmonics are sines and the 

even ones are cosines. 

(g) The Fourier series has the property that the odd harmonics are cosines and the 
even ones are sines. 

(h) The Fourier series has only even harmonics. 

(i) The Fourier series has harmonics of order a, 2a, 3a, •••, a being any fixed 
integer. 

2. Given a set of fimctions 4fk(0 for ^ = 1, 2, • • • n, with the following properties. 
0i(/) is periodic, having the period r. 

MO “ - h), <h(0 = ~ • • ^MO * - ^n-i) 

in which h, < 2 , * • * ^1 are any finite quantities. Let the svun of these functions be 
denoted by 4>(0> 1*^*1 

0(0 * 

♦ThU material may be found, for example, in Courant-Hilbert, op, cU,, pp. 436'440,or 
E. T. Copson, op, cU. pp. 330-336. 
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If the function has a Fourier series representation with the cosine and sine 
coefficients Gv and bvy respectively, show that the corresponding coefficients for the 
resultant periodic function 0(/) are given by the expressions, 

Ao - nao 

n n 

Av — Gv 2 COS vu)tk — bu ^ sin vojtk 

k=l = l 

n ^ n 

Bt, = by ^ cos vcotk + Uv ^ sin pcotk 

k=l 

Hint. Write the Fourier series for <f>k{t) and in exponential form first and then 

convert to the trigonometric forms. 

3. Suppose, for the set of functions defined in the preceding problem, one chooses 
tk = k(T/n). The periodic functions <^i, <f> 2 t ' "<t>n then form a cyclic group. Show 
that their sum </>(/), when it neither vanishes nor reduces to a constant, represents a 
periodic function with the period t/m. Show that the same is true of the function 
E(t) *= 01 X 02 X • • * X 0n given by the product of the functions forming the 
cyclic group. 

4. For the sum function 0(/) of the previous problem show that its Fourier coeffi- 

cients A^ and are given in terms of and by for the component functions by the 
simple relationships: A^ = UGuy By, = for /u = 0, 1, 2, • • • and v = Correlate 

this result with the formulas given in Prob. 2. 

5. As an application of the principles illustrated by the previous problems, let 

0i(/) = sin for 0 < / < - 
w 

ioi-<t< — 

0) w 

and consider the cyclic groups for n = 2, 3, 6, 12, representing wave forms resulting 
from polyphase rectification. Compute the coefficients Ay, and By according to the 
formulas of the previous problem and check, through carrying out the usual integra- 
tion. 

6. Square pulses of amplitude A and duration 5 = t/10 characterize the periodic 
fimction 

^i(<) = A for 0 < « < ^ 

^i(/) =0 for^<t<T 

Find the spectrum of this function and compare it with those obtained from a cyclic 
group for « =2 and » = 3 according to the principles given in the previous problems. 

7. In applying the formulas in Art. 15 to the numerical evaluation of Fourier 
coefficients for graphically given functions, a simplifying expedient is to alter the 
niunber of intervals n according to the order of the Fourier coefficient being calculated 
instead of using the same fixed number of intervals for the calculation of all coefficients. 
For example, if, in the use of formulas 299 to 302, one chooses n ~ r, all the Ukr and 
^kr become =bl or zero, and the resulting computations are correspondingly simplified. 
This procedure, known as the Fischer-Hinnen method, must evidently be applied 
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with care. The results are not good for the fundamental and lower harmonics but 
improve with increasing order. 

Try this method out on a function j{x) whose values for x ” ^ ^ ^ ‘ ^ 

respectively 0.0, 4.0, 7.0, 8.9, 9.7, 10, 9.5, 9.0, 8.7, 8.65, assuming that /(— x) = — /(x) 
and /(x -l-7r) = — /(x). Compute harmonics through the seventh. Compare with 
results obtained from computations that do not utilize this simplified approach. Plot 
both results and compare with a plot of the given function. 

8. For increments in x equal to 7r/12, starting with x = 0, the values of a given 
function /(x) are: 

0, 4.2, 8.9, 9.9, 9.5, 8.9, 8.5, 8.9, 9.5, 9.9, 8.9, 4.2, 0, 

1.5, 1.8, 1.9, 2.1, 2.5, 3.0, 2.5, 2.1, 1.9, 1.8, 1.5,0 


Choosing « = 12, use the formulas 299 to 302 to compute the harmonics through 
the 11th. Plot the resulting partial sum and compare with the given function. 

9. Given the function /(x) = /(x + 1‘Kk') defined by 


X 

a 



2t — X 
2t — a 


for 0 < X < a 
for a < X < 27r — a 
for 27r — a < X < 27r 


Show that for this function On = 0 and 

^ 2 sin na 

o(7r - a)2 


From this result determine the Fourier series for the following functions having the 
same period: 


/i(») 


h{x) = 


fix) = 


_5r — ac 

for 0 < X < 27r 

V 


for 

0 <x <- 



2 

to 

1 

for 

TT StT 

-Z <x <-;r 

«• 


2 2 

2(2ir - *) 

for 

— < X < 2ir 

3ir 


2 

X 

for 

0 < X <7r 

IT 

1 

1 

for 

TT < X < 27r 


TT 


Mairp sketches of all the functions. 
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10. Find the Fourier expansion for a periodic function defined by 


for 0 < / < 


.(.<0 


f{t) * sin TT - for a < / < r — a 

T 


for r — a < / < r 


Make a plot of the function and find the form of the Fourier series corresponding to 
a = 0. Find the sum of the series for / - a and t — t — a. 

11. A function /(/) is defined by 


0 



0 


for 0 < / < ^ — a 
for^-o</<^+o 
for ^ + a < f < T 


Determine the Fourier series expansion. Plot the function for a - t/ 5 as well as the 
partial sums 5i, 52, * * * 55, thus showing the manner in which the given function is 
approximated. Plot the spectrum function for the same value of a. 

12. A periodic function consists of a regular succession of identical pulses of short 

duration (similar to the function of Prob. 11 for a/r 1) the area under each pulse 
being A, Show that the values of the constant term and those of the fundamental and 
low'er harmonic amplitudes are very nearly independent of the detailed pulse shape 
(whether rectangular, triangular, sinusoidal, etc.), being proportional only to the 
pulse area. Deduce the pertinent relationships. As the duration of the pulse is assumed 
to become shorter and shorter, the area remaining the same (— ^4), show that the 
Fourier coefficients are ultimately given by ao = /I/t, * 2A/t, independent of n. 

13. Check the expansions of the following functions for the interval 0 ^ ac ^ t. 
Plot the functions and several terms of the series, noting rapidity of convergence. 


-x{r - ac) * 2; 

O n-l,3,S- 


sin nx 


^ cosfiac 

— (tt - 2x)(x* + 2xx - 2x*) = — 

96 n* 

14. Consider /(x) - — x) sin x within the interval 0 ^ x ^ t, and check the 

following series representation: 

- + 7 cos X — — cos 2x — — cos 3x — — cos 4x — • • • 

2 4 1*3 2*4 3-5 


15. By using the Cauchy principle of convergence, show that th^ series 


S 


^ sip ^ 

n-l n 




converges uniformly except at the points x - 0, 2x, 4ir, • • • . 
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16. Determine the regions of uniform convergence of the series 


5 


«e 

L 


n »0 


COS nx 
n 


and define the points at which it diverges. 

17. Show that the series 


2^ s" cos nz and 2 si® 

0 

converge absolutely and uniformly inside a circle of unit radius. 

18. Show that the series 


^ dnZ^ cos nz and sin nz 

0 

converge absolutely and uniformly inside a circle of radius 

R = limit 

n — ► « 

19. Discuss the convergence of the series 

30 

5 = UnX sin nx 

n =0 

in which a,* > Un+i > a»-f 2 • * • and Un — ^ 0 for « -► 00 . Does the point x = 0 belong 
to the region of uniform convergence? 

20. Show that the series 

sin 2jc sin 3x sin nx 

sm*+— +— + •••+ — +... 


converges uniformly except at points x - 0, db27r, ±4x, • 

21. Show thatrthe series 


O — ^ 

n»l,3,5. ••• 

converges uniformly to the function (tt ~ 2jc)7r/8 in the interval 0 ^ jc ^ tt and to 
the function (x — 37r/2)7r/4 in the interval t ^ x :-i 2x. 

22. Show that the expansion 


K?-) 


1 o 1 , 1 . 

cos X - cos 2x + cos 3x - 75 cos 4x H 

2 * 3 ^ 4 ^ 


converges imiformly in the interval -ir ^ x ^ x. Through an obvious change of 
variable obtain the series 

, c* 4c*r Try 1 2xy 1 3xy 1 

y. . _ _ _ l^cos- - ^cos— +^cos— - + . • -J 

and state its interval of uniform convergence. 

23. Using the results of Prob. 22, show that 


12 2*^3* 4*^ 
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and 


T* 111 

42 "^' 


24. Using the results of Prob. 22, show that the series 

c 4r r 7r:c , I 37r.v , 1 Swx 1 


converges uniformly within the interval 0 ^ .r ^ r, and obtain the result 

7r2 1 1 1 

25 Through the use of a Laurent expansion obtain the series 
1 — r cos 2 


1 — 2r cos 2 4- 


= I -f- r cos 2 4- r- cos 22 4- r® cos 3z 4- • • • 


in which r is real but 2 may be complex. Show that the region of convergence is defined 
as —1 < r < 1. 

26. Find the continuous spectrum for the function 

fO for / <0 

m = \ 

[Ae sin 0 ) 0 ^ for 0 ^ < 00 

Plot the result for A = 10, wo = 2ir X 10®, a = 2 X 10^. 

27. Find the Fourier transforms of the following functions; 


(a) 


(b) 


fit) - 0 for / < 0 
fit) at lQr0<t<h 

fit) = ati - bit - h) for < / < /2 * /i 4- 1') 

fit) =0 for /2 < / < 00 

fit) =0 for / < /i > 0 

/(/) = for/i</<t2 

/(/) =0 for ^2 < ^ < ®o 


28. Let G(w) = lirgio)) and introduce Fit) X fit) in which <r is a real quantity. 
Then letting 5 = <7 4“i«, obtain from the Fourier transforms the Laplace transforms 


Gis) 


1 ^«^4i • 

2 irj — j 00 

-x: 


Gis)e‘^ds 


dt 


Show that the condition for the existence of the transform G($) may be expressed by 
stating that the integral 


J* \€~^^Fif)\ dt remains bounded 
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Thus show that if 1^(0 1 < in which C and c are positive real constants, the 
transform G{s) exists so long as a > c. (The quantity c is, therefore, called the abscissa 
of uniform convergence.) 

29. Find the abscissa of uniform convergence for the following functions 

1 1 ^ 1 1 1 1 1 

s' s-\a' s - a' (s a)(s - b)' {s + a)^ ' {s - a)^ ' 

5 ^ 1 1 

(5 4-a)(5— a)^ (\/^ 4 - V j a^/s — b s — a 

30. With the interpretation given in Prob. 28, and assuming F{t) = 0 for / < 0, 
show that the transforms of the following functions: 

a, sin a/, cos a/, sinh aty cosh aty e“®* sinh bt, 

cosh bt, ty t sin aty t cos a/ (in w'hich a and h 

are positive real constants) 

do exist; and by direct integration obtain for G(5) respectively: 

a 1 a s a s b 

s' s a' 5* + a- * 5^ + a- * {s a)^ — b^' 

5 4- 1 w ! las s^ — 

(i +a)=-6*’ (s^+a^y' (j® + ay 

31- Through contour integration find F(t) corresponding to 

“ (s - a)(s - b)(s - c) 

in which a, b, c are real (unequal) quantities. Without further direct integration, 
what are the time functions corresponding to the following transforms: 

£ ^ ^ 1 

(j -- a)(s — b)(s — c) * (j ~ a){s — b)(s — c) s(s — a ) (5 - b)(s - c) 

, 32. Starting with the pair of transforms 

F(i) =e®‘ G(s) = 

find through convolution the time functions corresponding to 

1 1 1 1 , 

(5 — a) 2 * (s — a )(5 — 6 ) * (5 - o)(^ + b) (s +a)(s + b)(s + c) 

^ 

(5 +a)(5-6)(5-c) 
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A 

Abel, 285 

Abridged quadratic form, 165 
Absolute convergence, 277 
circle, 283 
Adjoint, 42, 47 
Admittance function, 359 
Affine co-ordinates, 85 
scale of length in, 88 
transformation, 85 
Algebra, fundamental law of, 325 
Algebraic functions, 314, 318, 321 
singularities, 318 

Alternating component, of a periodic func- 
tion, 471 

Alternatives, rule of, 105 
Altitude function, 183, 195, 351 
Amplitude spectrum, 473 
Analytic continuation, 288, 549 
functions, 257 

Approximation, Bessel functions, 509, 568 
Fej6r polynomials, 496 
Fourier series, 483, 485 
Gibbs phenomenon, 495 
orthogonal functions, 442, 459, 501 
uniform tolerance for, 502 
Arc, differential, 198, 235 
Arithmetic mean sequence, 285 
Augmented matrix, 64 
Axes, principal, 137, 156, 159 
semi-, 140, 144 
co-ordinate {see Co-ordinates) 

Axial vector, 183 

B 

Beating, 513 

Bessel equation, 440, 562 
functions, 440, 506, 561 
approximation, 509, 568 
relation to Fourier series, 506 
Bierens de Haan, 345 
Bilinear form, 134 
transformation, 363 
Bipolar circle, 372 
Birkhoff, G., 142 
Bocher, M., 156 
Bode, H. W., 343, 346 


Bolzano-Weierstrass theorem, 276 
Bordered determinant, 11 
Borel, 285 

Boundary conditions, 207, 379, 440 
distributions, 441 
-value problems, 207, 379, 387 
Branch cut, 315 

integration across, 316, 560 
point, 296, 314 
dehnition, 314 

integration around, 310, 316, 560 
interpretation as a vortex, 351 
logarithmic, 319 

C 

Canonic form, matrix, 61 
quadratic form, 150, 154, 156, 159, 172 
Caratheodory, C., 1 

Cartesian co-ordinates, 85, 100, 137, 159, 184, 
188, 235 

Casorati-VVeierstrass, theorem of, 296 
Cauchy integral formula, 272 
integral law, 267 

principal value of an integral, 339 
principle of convergence, 277, 466 
residue theorem, 302 
-Riemann equations, 256 
theorem on the number of zeros in a region, 
407 

Cayley-Hamilton theorem, 118 
Central conics, 135 
Ces^ro sum of a series, 285, 496 
Chain, dipole, 358 

Characteristic determinant, 111, 141, 162 
equation of a matrix, 62, 111 
function of a matrix, 111 
matrix, 111, 141 

values of a matrix {ste Latent roots) 
Charge density, 204, 291, 349 
Checkerboard rule, in determinants, 7 
Christoffel, 378 
Circle, bipolar, 372 
convergence, 283, 288 
transformation of, by fractional transfor- 
mation, 366 

unit, 284, 361, 369, 417, 419 
Cluster point, 276 
Cofactor, definition, 6 
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Cofactor, minor, 6 

relation to direction cosines, 112, 162 
use in inverting a matrix, 41 
use in solving equations, 13 
Collineation, 117 {see also Linear transfor- 
mation) 

Collineatory transformation, 117, 137 
effect on latent roots, 1 18 
Column matrix, 32 
Comparison test, for series, 279 
Complex integration {see Integration) 
plane, 253, 258, 262 
simple, 365 
sphere, 262, 317, 376 
variables, 253 
conjugate value, 257 
definition, 253 

functions of {see Functions of a complex 
variable) 

polar representation, 253 
Condensation, point of, 276 
Condenser, 387 
Conditional convergence, 281 
Conditioned maxima, method of, 143, 211 
Conformal mapping, 258, 326, 330, 360, 378 
algebraic function, 315, 318 
at a zero or saddle point, 300 
circle, 363 
complex sphere, 263 
general function, 380 
inverse function, 326 
inversion, 331, 361 
linear fractional function, 363 
logarithm function, 310, 325 
polygon, 383 

polynomial functions, 397 
reciprocal function, 360 
reflection, 361 

Schwarz-Christoffel transformation, 378 
use of exponential function, 379, 391 
Congruent transformation, 136, 149, 157 
Conics, 135 

Conjugate diameters of an ellipse, 160 
matrix, 43 

potential functions, 333, 351 
Constraint, plane, 165 
vector, 166 

Constraints, linear, on a quadratic form, 165, 
172 

Continuant, 21 

Continuation, analytic, 288, 549 
Continued fraction expansion, 403 


Contour integration, 199, 204, 223, 264, 268, 
548 

functions of a complex variable {see 
Integration) 

functions of a real variable, evaluation, 
432, 548 
lines, 196 
map, 196 

Contragredient sets of variables, 95 
Contravariant components of a vector, 95, 
159 

Convergence, series {see Scries, convergence) 
uniform, 282, 465, 500 
abscissa of, 575 
Convolution, 530 
Co-ordinates, affine, 85 
scale of length in, 88 

Cartesian: 85, 100, 137, 159, 184, 188, 235 
cylindrical, 234, 238 
general, 28, 234, 439 
normal. 111 

orthogonal, 81, 85, 97, 137, 140, 165, 184, 
234, {see also Co-ordinates, Car- 
tesian) 
reciprocal, 90 

rectangular {see Co-ordinates, Cartesian) 
right- and left-hand, 84, 184 
spherical polar, 234, 239 
transformation of, effect on latent roots, 
118 

effect on quadratic form, 135, 165 
oblique systems, 85, 97 
orthogonal systems, 81, 166, 184, 234 
Copson, E. T., 562, 569 
Couple, 543 

Courant, R., 142, 562, 569 
Covariant components of a vector, 95 
Cramer's rule, 15 , 

Critical points, 382 
Cross product, 190 
Crout, P. D., 64 

Curl, in curvilinear co-ordinates, 243 
definition, 208 
determinantal form, 213 
divergence of, 218 
of the gradient, 213 
interpretation as a linear vector 
formation, 228 
surface, 217 
of the vector r, 234 
Curvilinear co-ordinates, 89, 234 
curl in, 243 
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Curvilinear co-ordinates, divergence in, 242 
gradient in, 240 

Cylinder functions, 440, 507, 562 
Cylindrical co-ordinates, 234, 238 

D 

d*Alembert ratio test, 278 
Dedekind test for convergence, 282 
Definiteness of quadratic forms, 144, 149,150, 
161, 171 

Degeneracy {see also Nullity, Rank) 
matrix, 61, 105, 109, 137 
degree of, 61 
quadratic form, 137 
Del, 197 

Dependence, linear, algebraic equations, 18, 
100 

vector fields, 186 
vector set, 77, 100, 114, 143 
Derivative, directional, 196 
of a function of a complex variable (see 
Functions of a complex variable, 
derivative) 
normal, 196 

of the singularity functions, 542, 546 
Determinant, bordered, 11 
characteristic, 111, 141, 162 
checkerboard rule, 7 
cofactor, 6, 40, 112, 162 
Cramer’s rule, 15 
definition, 1 
development of, 7 
evaluation, 3, 9 
fundamental properties, 1 
Gramian, 146, 151 
Laplace’s development, 7 
of a matrix, 31, 40, 137 
minor, 4, 19, 56 
principal, 5, 154, 406 
multiplication, 11 
order, 1 
product, 11 
rank, 18, 39, 61, 78 
symmetrical, 15 
triangular form, 7 
use n solving linear equations, 13 
% i'onskian, 25 

Determinantal form, curl, 213 
Hur^’itz criterion for polynomials, 405 
scalar triple product, 194 
vector product, 192 


Detour, semicircular, in contour integration, 
382, 549, 552, 560 

Diagonal form, reduction of a matrix to, 59, 
111, 137, 146, 152 

relation to quadratic form, 137, 143 
Dickson, L. E., 119 
Differential arc, 198, 235 
equations, Bessel’s, 440 
Cauchy-Riemann, 256 
Laplace’s, 207, 243, 266, 378, 439 
Legendre’s, 440 
Poisson’s, 207 
solution, 441 
wave, 440 
rectangle, 212, 269 
vector, 198, 201, 235 
volume, 202, 209, 241 

Differentiation, functions of a complex vari- 
able, 254, 275 
of a power series, 284 
of a vector, 225, 232 
Dipole, 216, 353 
chain, 353 
moment, 353 

Direction cosines, 82, 101, 112, 162 
parameters, 126 
Directional derivative, 196 
Dirichlet conditions, 463, 511, 520, 527 
Discriminant, 132, 137, 146, 154 
Distribution, boundar>', 441 
initial, 441 
source, 200, 206, 291 
Divergence, 200, 228, 264 
of the curl, 218 

in curvilinear co-ordinates, 242 
definiti->n, 201 
of the ‘gradient, 206, 220 
surface, 206 
of the vector r, 233 
Dominant series, 279 
Dot product, 188 
Double stratum, 357 
Doublet, 353 
unit, 542, 546 

E 

Effective value, 478 
Fourier transform of, 530 
Eigenwerte, 140 (see also Latent roots) 
Electric charge density, 204, 291, 349 
field, 184, 349, 387 



S80 


INDEX 


Electric charge density, field, intensity, 349 
networks, 158, 346, 359, 410, 416, 477, 481 
transmission line, 441, 452, 555 
Elementary functions, differentiability, 257 
transformation matrix, 54, 146 
Ellipsoidal surface, 138, 144, 149, 156, 165 
Elliptic transformation, 368 
Energy functions, 156, 184 
potential, 200, 222 
Entire functions, 297, 321 
Envelope functions, 515 
Equations, characteristic, of matrices, 62, 1 ] 1 
differential, Bessers, 440, 562 
Cauchy-Riemann, 256 
Laplace^s, 207, 243, 266, 378, 439 
Legendre’s, 440 
Poisson’s, 207 
solution, 441 
wave, 440 

linear (see Linear algebraic equations) 
Error, 483, 502 
function, 544 

Essential singularities, 296 
Euler, 285 

Even functions, 340, 341, 398, 412, 450 
harmonics, 454 
Expansion (see also Series) 
continued fraction, 403 
of functions in series, 286, 290, 305, 307, 
415, 441 
orthogonal, 442 
partial fraction, 307, 415 
Exponential functions, in conformal map- 
ping, 379, 391 
Fourier series, 460, 531 
orthogonality, 444 
singularities, 296 

F 

Fej6r polynomials, 496 
relation to Fourier series, 497 
uniform convergence of, 500 
Field, electric, 184, 349, 387 
force, 183, 200 
lamellar, 185 
map, 387, 391 

potential, 185, 200, 218, 221, 231, 234, 263, 
333,351 
scalar, 183 
source, 185 

source-free, 185, 207, 218, 265 


Field, vector (see Vector field) 
vortex, 186 

Filamental source distribution, 204 
Fischer-Hinnen method, 570 
Flow lines, 184, 267 

source of, 185, 203, 388, 392 
map, 185, 388 
Flux density, magnetic, 349 
Force field, 183, 200 
gravitational, 200 
vector, 198, 208 
work due to, 198 
Formula, Cauchy’s integral, 272 
Schwarz-Christoffel, 380 
summation, 479 
Fourier analysis, 501, 519 
graphical, 501 
integral, 51 f 

alternative forms, 522, 523 
derivation from Fourier series, 518, 
537 

elementary properties, 524 
Gibbs phenomenon, 522 
series, alternating component, 471 
approximation with, 483, 485, 501 
coefficients, 449, 460, 472 
complex, 460, 472 
convergence, 459, 464 
cosine, 450 
definition, 448 

Dirichlet conditions, 463, 511, 520, 527 
for even function, 450 
exponential form, 460 
fundamental period, 452 
Gibbs phenomenon, 495 
harmonics, 453, 454, 503 
least-squares property, 483 
for odd function, 450 
product of, 477 
rectangular wave, 470, 475 
relation to Bessel functions, 506 
relation to Fej^r polynomials, 497 
relation to Fourier integral, 518, 537 
relation to Laurent series, 459 
saw-tooth wave, 456, 464 
sine, 450 
summation, 479 
liiangluar wave, 471 
in two variables, 51 1 
value at a discontinuity, 464, 485, 500 
spectrum, 473 
synthesis, 519 
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Fourier transform, inverse, 473, 519 
of a product, 528 

Fractional function {see Linear fractional 
• transformation) 

Frequency, 410, 452, 523 
domain, 472, 519 
groups, 512 

spectrum, 473, 514, 517 
Functions, altitude, 183, 195, 351 
Bessel, 440, 506, 561 
characteristic. 111 

comi)lex variables (see Complex variables) 

conjugate potential, 333, 351 

cylinder, 440, 507, 562 

energ)', 156, 184 

envelope, 515 

error, 544 

even. 340, 341, 398, 412, 450 
expansions in series, 286, 290, 305, 307, 415 
exponential, time, 531 
fractional {see Linear fractional trans- 
formation) 

Hankel, 508 

mapping (see Conformal mapping) 
Neumann, 564 
odd, 340, 341, 398, 412, 450 
orthogonal, 442, 459, 501 
orthonormal, 442 

jjeriodic, 452, 477, (see also Fourier series) 

permanence of form of, 290 

positive real (see Positive real functions) 

potential (see Potential function) 

proper, 439, 440 

'pseudoscalar, 184 

pulse, 474, 494, 499, 521, 536 

real variable, integration of, 432 

scanning, 488, 498 

sequence of, 285 

singularity, 531, 544, 553 

spectrum representation, 473, 519 

transient, 517 

trigonometric, 439 

vector (see Vector function) 

Functions of a complex variable, algebraic, 
341, 318, 321 
analytic, 257 
branches, 318 
clas^hcation, 295, 317 
continuity, 258 
definition, 254, 256 
derivative, continuity, 257 
existence of, 257, 275 


Functions of a complex variable, derivative, 
order, 275 
uniqueness, 255 

differential condition equations, 256 
differentiation, 254, 275 
entire, 297, 321 

graphical representation, 253, 258, 267 
holomorphic, 257 
identity theorem, 290 
impedance, 359, 410, 416 
integral, 297, 548 
integration (see Integration) 
inverse, 260, 321, 326 
logarithm, 310, 313, 325 
mapping, conformal (see Conformal map- 
ping) 

isogonal, 259 

maximum and minimum values in a region, 
328, 334 

maximum modulus, principle of, 328, 334, 
417 

mermorphic, 297, 321 
multivalued, 261, 289, 296, 310, 1317, 321, 
326 

natural boundary, 288 

positive real (see Positive real functions) 

raUonal, 297, 307, 321, 359, 398, 410 

reciprocal, 305, 315, 360 

regular, 257 

relation between real and imaginary parts 
of, 256, 333, 339, 351 
schlict, 365 

singularities, algebraic, 318 
branch points, 315, 319 
definite' i, 257 

effect I n convergence of Taylor’s series, 
287, 288 
essential, 296 
infinitely dense, 288, 291 
integration around, 290, 304 
interpretation as vortexes, 265, 351 
isolated, 291 
logarithmic, 296, 345 
poles, 296, 323 
residue at, 304 

series expansions about, 290, 319 
types, 295 

transcendental, 297, 321 
uniqueness, 290 
uniqueness theorem., 290 
Fundamental component of a periodic func- 
tion, 453 
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Fundamental law of algebra, 325 
metric tensor, 92, 151 
period, 452 

properties of determinants, 1 
G 

Gaussian plane, 253 
Gauss’s law, 203, 217, 264 
Gibbs notation for vectors, 188, 190 
phenomenon, 495, 522, 535 
Gradient, 183, 195, 226 
curl of, 213 

in curvilinear co-ordinates, 240 
definition, 196 
divergence of, 206, 220 
notation, 197 
operator, 226, 230, 232 
of a scalar product, 230 
of the vector r, 233 
Gramian determinant, 146, 151 
Gravitational force, 200 

H 

Hamiltonian operator, 197, 203, 214 
Hankel functions, 508, 563, 568 
Harmonics, amplitudes, 453, 502 
analysis, 501 
even, 454 
odd, 454 

phase angles of, 453, 462, 526 
spherical, 440 

Heaviside Operational Calculus, 541 
Hilbert, D., 142, 562, 569 
transforms, 339 

application to network theory, 346 
degenerate forms, 343 
Holder, 285 

Holomorphic functions, 257 
Homographic transformation, 364 
Hurwitz criteria, 395 
pol 3 momial, definition, 395 
even and odd parts, 398, 412 
importance in positive real functions, 41 1 
properties, 397, 399, 401 
test for Hurwitz character, 401, 403 
determinantal form, 405 
zeros, 395 

Hydrodynamic analogy, 184, 201, 204, 208, 
216, 263, 300, 388 
Hyperbolic transformation, 367 


I 

Identity matrix, 38 
theorem, 290 
Image, 331, 361 

Impedance function, 359, 410, 416, 481 
Impulse, unit, 539 

Independence, linear, algebraic ccjuations, 
18, 100 

vector fields, 186 
vector set, 77, 100, 114, 143 
Inertia, law of, 148 
Infinite series (5r« Series, infinite) 

Infinity, point at, 262 
integration around, 307, 549 
vertex of polygon at, 384 
Initial conditions, 441 
Inner product, 188 
Integrability, 463 

Integral, Cauchy principal value of, 339 
formula, Cauchy% 272 
Fourier, 517, 548 

derivation from Fourier series, 518 
functions, 297 
1-aplace’s, 548 
law, Cauchy’s, 267 
line, 198, 267,311 
Poisson's, 330, 379 
sine-, 491, 501, 521, 534 
Sommerfcld, 507, 562 

Integration, contour, 199, 204, 223, 264, 268, 
548 

functions of a real variable, evaluation, 
432 

vector function, 199, 264 
functions of a complex variable, across a 
branch cut, 310, 560 
around a branch point, 310, 316, 560 
around the iwint at infinity, 307, 549 
around singularities, 290, 304, 382, 549, 
552, 560 

Cauchy’s integral formula, 272 
Cauchy’s integral law, 267, 548 
Cauchy’s residue theorem, 302 
independence of path, 268 
over a regular region, 267, 290 
power series, 284, i303 
saddle-point method, 565 
semicircular detour, 382, 549, 552, 560 
line, 198, 267, 311 
surface, 201, 204, 209, 215 
volume, 204, 230, 264 
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Inverse Fourier transform, 473, 519 
function of a complex variable, 260, 321, 
326 

linear transformation, 15, 39, 81 
matrix, 39, 44, 47, 57 
Inversion, graphical, 331, 361 
matrix, 41, 50, 63 
Irrotational field, 185 
Isogonal mapping, 259 
Iterated quadratic form, 155 

J 

7 -axis, 253 
Jacobian, 26, 237 

I. 

Lagrange’s identity, 245 
Lagrangian multiplier, 143, 211 
Lamellar field, 185 

Laplace’s development of determinants, 7 
e<{uation, 207, 243, 378, 439 
in Cartesian co-ordinates, 207 
in curvilinear co-ordinates, 243 
solution, 207 

in two dimensions, 266, 378 
integral, 548 
transform, 548 
Laplacian, 207, 220 
in curvilinear co-ordinates, 243 
operator, 206, 233 

Latent roots, associated with a positive defi- 
nite quadratic form, 149, 171 
associated with a quadratic form, 139, 149, 
157, 162, 171 
definition, 62 
distinctness of, 112, 141 
invariance with collineatory transforma- 
tion, 118 

of one matrix with respect to another 
matrix, 158, 162 
power of a matrix, 155 
relation to quadric surfaces, 139 
separation property of, 174 
symmetrical matrix, 121 
Laurent series (expansion), 290, 305, 319, 
324, 410, 459 
ascending part, 294 
convergence, 293 
descending part, 294 
integration of, 303 


Laurent series (expansion), near an algebraic 
singularity, 319 
principal part, 294, 308 
relation to Fourier series, 459 
relation to Taylor’s series, 290, 294 
Law of inertia, 148 

Least-squares approximation, 483, 501 
Left-handed co-ordinate system, 84, 184 
Legendre’s equation, 440 
Limit point, 276 
Line integral, 198, 267, 311 
spectrum, 473, 514 
transmission, 441, 452 

Linear algebraic equations {see also Linear 
transformation) 
equivalent set, 64 
homogeneous, 17 
independence, 18, 100 
inverse set, 14, 39, 81 
solution, Cramer’s rule, 15 
existence of, 16, 40, 100, 104, 115 
use of determinants, 13 
use of matrices, 39, 49, 63, {see also 
Matrix, inversion) 

Linear dependence and independence, of 
algebraic equations, 18, 100 
of vector fields, 186 
of a vector set, 77, 100, 114, 143 
Linear form, 133 

Linear fractional transformation, 363, 397, 
417 

fixed points of, 365 

interpretation on complex sphere, 376 
limitations of, 337, 365, 376 
properties of, 367, 373, 375 
of sch'act functions, 365 
Linear transformation, 30, 76 {see also Linear 
algebraic equations) 
affine, 85 

congruent, 136, 149, 157 
efifect on quadratic form, 135 
inverse, 15, 39, 81 
matrix of, 30, 54, 76, 136, 146, 157 
of a matrix, 54, 76, 136, 146, 149, 157, 
227 

orthogonal, 62, 81, 137, 156 
quadratic form of, 132, 151, 161 
vector significance of, 79, 186 
of vectors, 186, 227 

Logarithm, of a function of a complex vari- 
able, 310, 325 
principal value of, 313 
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Logarithmic branch point, 319 
singularity, 296, 345 

M 

M.I.T. Staff, 185, 207 
MacLane, S., 142 
Maclaurin’s series, 287, 330, 352 
Magnetic flux density, 349 
Map, contour, 196 
field, 387, 391 
flow, 185, 388 

Mapping, conformal (see Conformal mapping) 
function, 380 
isogonal, 259 
surfaces, 195 
Matrix, addition, 35 
adjoint. 42, 47 
augmented, 64 
canonical form, 61 
characteristic. 111, 141 
characteristic equation, 62, 111 
characteristic function, 111 
characteristic values, 62, 111 
column, 32 

conformability, 36, 49 
congruent transformation of, 136 
conjugate, 43 
definition, 30 

degenerate, 61, 105, 109, 137 
degree of degeneracy, 61 
determinant of, 31, 40, 137 
sign, 45, 84 
diagonal, 38, 42 
division by, 40 
Eigenwerte of, 140 
equality of, 32 
equivalence, 58 
identity, 38 
inverse, 39, 44, 47, 57 
inversion, 41, 50, 63 
latent roots (see Latent roots) 
of a linear transformation, 30, 54, 76, 136, 
140, 157 

modal, 116, 149, 155, 162 
multiplication, 31, 35 
nonsingular, 39 
null, 107 
nullity, 62, 109 
order, 31 

orthogonal, 45, 48, 57, 62, 84, 137, 148, 
166, 237, 442 


Matrix, partitioned, 48 
powers of, 120, 155 
Cayley-Hamilton theorem, 118 
product, 31, 35 
null, 107 

proper values, 140 
of a quadratic form, 132 
rank, 39, 43, 61, 105, 114, 159 
reciprocal, 45, 48 

reduction to diagonal form, 59, 111, 137, 
146, 152 

relation to quadratic form, 137, 143 
row, 31 
scalar, 38 
singular, 39, 47 
skew symmetrical, 39 
submatriccs, 48 
symmetricar, 39, 115 
of tensors, 187 

transformation, 54, 76, 136, 146, 149, 157, 
227 

transpose, 43, 46 
triangular, 67, 152, 154 
unit, 38, 54 

use in solving linear equations, 39, 49, 63 
Maximum modulus, principle of, 328, 334, 
417 

Mean square value, 478, 483, 530 
Membrane, 441 

Mermorphic functions, 297, 321 
Minor, 4, 19, 59 (see also Cofactor) 
complement of, 5 
principal, 5, 154, 406 
Modal matrix, 116, 149, 155, 157, 162 
Modulation, 526 
Moment, dipole, 353 
vector, 126, 354 

Multiplicity of poles and zeros, 296, 298 
Multiply connected region, 221, 223, 271 
Multivalued functions, 221, 261, 289, 296, 
310, 317, 321, 326 
I>otential, 221 

Schwarz-Christoffel transformation, 382 
N 

if>dimensional space, 76, •90, 134, 167 
Natural boundary, 288, 422 
Network, lossless, 416, 555 
theory, 158, 346, 359, 410, 416, 477, 481 
Neumann functions, 564 
Nonturbulent field, 185, 187, 213, 265 
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Normal co-ordinates, 111 
derivative, 196 

form, of a quadratic form {see Canonic 
fornr) 

Normalization of functions, 442 
of vectors, 89 
Null matrix, 107 
Nullity, 62 

Sylvester’s law of, 10*^ 

O 

Oblique co-ordinates, 85, 97. 159 
Odd functions, 340, 341, 398, 412, 450 
harmonics, 454 

Oi>erator, gradient, 226, 230, 232 
Hamiltonian, 197, 203, 214 
Laplacian, 206, 233, 439 
Order, determinant, 1 
matrix, 31 
tensor, 187 

Orthogonal circles, in linear fractional trans- 
formation, 366 

co-ordinates, 81,85,97, 137, 140, |l65, 184, 
234 {see also Co-ordinates, Cartesian) 
expansions, 442 
families of curves, 267 
functions, 442 

linear transformation, 62 81, 137, 156 
matrix, 45, 48, 57, 62, 84, 137, 148, 166, 
237, 442 

polynomials, 442 

vectors, 81, 84, 107, 121, 168, 189, 190, 266 
vector sets, 81, 121, 168 
Orthogonality, conditions, 442 
exponential functions, 444 
trigonometric functions, 444 
Orthonormal functions, 442 

P 

p.r. function {see Positive real functions) 
Parallelepiped, curvilinear, 241 
volume, in terms of vector product, 194 
Parallelogram, area of, in terms of vectors, 
190 

law of addition, 188 
Partial fraction expansion, 307, 415 
Partitioned matrix, 48 
Periodic functions, 452 (see also Fourier series) 
alternating component, 471 
beating, 513 


Periodic functions, effective value, 478 
harmonics, 453 
mean square value, 478 
product, 477 

Permanence of form of functions, 290 
Phase angle, harmonics of periodic function, 
453, 462, 526 
spectrum, 473 

Plane, complex, 253, 262, 268 
constraint, 165 
z-plane, 258 

Point, branch, 296, 314, 319, 351 
cluster, 276 
of condensation, 276 
lived, 365 

at infinity, 262, 307, 384 
limit, 276 
saddle, 298, 321 
set, 276 
singular, 296 
source, 204 
of stagnation, 298 
stationarv', 144, 172 
winding, 314 
Poisson’s equation, 207 
integrals, 330, 379 
Polar vector, 183 
Poles, 296, 323 
detection of, 326 
multiplicity, 296 

of a positive real function, 410, 414 
separation property, 400 
Polygon, mapping of, 383 
vertex angles, 385 

Polynorr al, conformal mapping, 397 
differtjitiability, 257 
Fejcr, 496 

Hurwitz, 395, 401, 411 
orthogonal, 442 

in rational fractions, 308, 359, 398, 410 
trigonometric, 436 
Tschebyscheff, 431 
zeros of, 308, 324 

Positive definite quadratic form, 144, 150, 
164, 171 

latent roots of, 149, 171 
Positive real functions, definition, 409 
polar form of, 419 
poles and zeros, 410, 414 
properties, 411, 112, 417, 419 
relation to Hurwitz polynomials, 41 1 
residues, 411, 414 
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Potential energy, 200, 222 
field, 185, 200, 218, 221, 231, 234, 263, 333, 
351 

function, conjugate, 333, 351 
multivalued, 221 

scalar, 183, 195, 199, 206, 221, 439 
vector, 218, 263, 266 
theory, 330, 349, 439 
dynamic, 439 
Power, average, 478, 530 
factor, 478 
product, 477, 428 
series {see Series, power) 

Principal axes, 137, 156, 159 
minor, 154, 406 

part of I^urent series, 294, 308 
value of an integral, 339 
value of logarithm, 313 
Projection, stereographic, 262, 372 
Proper functions, 439, 440 
values of a matrix, 140 
values of wave equation, 441 
Pseudoscalar functions, 184 
Pulse functions, 474, 494, 499, 521, 536 
Fourier transform of, 474, 521, 536 

Q 

Quadratic form, 132 
abridged form, 165 

associated latent roots, 139, 149, 157, 162, 171 
associated linear transformation, 132, 151, 
161 

bilinear form, 134 

canonic form of, 150, 154, 156, 159, 172 
degeneracy, 137 

discriminant of, 132, 137, 146, 154 
effect of constraints upon, 165, 172 
effect of co-ordinate transformation upon, 
135, 165 

effect of linear transformation upon, 135 
geometrical interpretation, 134, 159, 165 
{su also Quadric surface) 
iterated, 155 
law of inertia, 148 
matrix of, 132 

normai form {su Canonic form) 
positive and negative definite, 144, 149, 
150, 164, 171 
rank, 133, 137, 146, 148 
reduction to sum of squares, more than 
two forms, 164 


Quadratic form, reduction to sum of squares, 
single form, 137, 146 
two forms, 156, 159 
signature, 148 

stationary points of, 144, 172 
Quadratic surface {su Quadric surface) 
Quadric surface, central, 135 
ellipsoidal, 138, 144, 149, 156, 165 
principal axes of, 137, 156, 159 
relation of latent roots to, 139 
semiaxes, 140, 144 

vector interpretation, 134, 137, 145, 159 
Quadripole, 354 

R 

r- vector, 231 
curl, 234 
divergence, 233 
gradient, 233 

Radius of convergence, 283 
Rank, determinant, 18, 39, 61, 78 
matrix, 39, 43, 61, 105, 114, 159 
quadratic form, 132, 137, 146, 148 
vector set, 78, 101 
Ratio test, 278 

Rational fraction, 308, 359, 398, 410 
functions, 297, 307, 321 
Reactance, 416 
Reciprocal co-ordinates, 90 
function, 305, 315, 360 
matrix, 45, 48 

Rectangular co-ordinates {su Cartesian co- 
ordinates) 

Reflection, 361 

Region of analyticity, 257, 288 
integration around, 268, 290 
multiply connected, 221, 223, 271 
simply connected, 221, 222, 271 
Regular function, 257 
Residue, definition, 304 
evaluation, 293, 302, 305, 308 
Cauchy's theorem, 302 
in positive real functions, 411, 414 
Riemann, 256, 281, 507 
surfaces, 289, 310, 318 
visualization of, 315; 317, 321 
Right-handed co-ordinate system, 84, 184 
Ring, vortex, 216 
Root-mean-square value, 478 
Roots {su Zeros) 

Rotational field, 185 
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Rouch^'s theorem, 327 
Routh, 395, 407 
stability criteria, 395 
Row matrix, 31 
Rule of alternatives, 105 

S 

Saddle point, 298, 321 
conformal mapping near, 300 
method of integration, 565 
Saw-tooth wave, 456, 464 
Scalar field, 183 

jx)tential function, 183, 195, 199, 206, 221 
product of vectors, 80. 122, 184, 188, 228, 
266 

gradient of, 230 
triple product, 193, 198 
Scanning function, 488, 498 
Schlicht functions, 365 
Schwarz-Christoffel formula, 380 
transformation, 378 
critical |X)ints, 382 
examples, 387 
multivaluedness, 382 
Schwarz^s lemma, 327 
Screw rule, 183, 188, 209, 211, 214, 217 
Scmiaxes, 140, 144 

Semicircular detour, in contour integration, 
382, 549, 552, 560 

Separation property, of zeros and poles, 400 
Sequence, arithmetic mean, 285 
of functions, 285 
of points, 276 
Series, Cesaro sum, 285 
convergence, absolute, 277 
Cauchy’s principle of, 277, 466 
circle, 283, 288 
comparison test, 279 
conditional, 281 
d’Alembert ratio test, 278 
Dedckind test, 282 
at a discontinuity, 464, 485, 500 
Fourier, 459, 464 
Laurent, 293 
Maclaurin, 287 
radius of, 283 
Taylor, 287, 288 
uniform, 283 
dominant, 279 

expansions of functions in, 286, 290, 305, 
307, 403, 415, 441 


Fourier, 448 
geometric, 436 
infinite, 276 

Laurent, 290, 305, 319, 324, 410, 459 
Maclaurin, 287, 330, 352 
power, 280, 283 

differentiation and integration, 284, 303 
expansions of functions in, 286, 290, 305 
summation, 479 
Taylor, 287, 305, 322, 323 
in two variables, 511 
Signature, 148 

Simply connected region, 221, 222, 271 
Sire-integral, 491, 501, 521, 534 
Singular matrix, 39, 47 
point, 296 

Singularities, algebraic, 318 
branch points, 315, 319 
definition, 257 

effect on convergence of Taylor’s series, 
287, 288 
essential, 296 
exponential function, 296 
infinitely dense, 288, 291 
integration around, 290, 304, 382, 548, 552, 
560 

interpretation as vortexes, 265, 351 

isolated, 291 

logarithmic, 296, 345 

poles, 296, 323 

residue at, 304 

evaluation, 293, 302, 305, 308 
series expansions about, 290, 319 
types, 295 

Singularity functions, 531, 544, 553 
Sink, 1^0, 203, 388, 392 
Skew-symmetrical matrix, 39 
Solenoidal field, 185 
Sommerfeld, 507 
integral, 507, 562 
Source {see also Vortex) 
density, 215 

distribution, 200, 206, 291 
filamental, 204 
idealized, 204 
field, 185 

of flow lines, 185, 203, 388, 392 
point, 204 
voltage, 478 

Source-free field, 185, 207, 218, 265 
Spectrum, amplitude, 473 
continuous, 517 
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Spectrum. Fourier, 473 
frequency, 473, 514, 517 
line, 473, 514 
phase, 473 

Sphere, complex, 262, 317, 376 
Spherical harmonics, 440 
polar co-ordinates, 234, 230 
Square wave, 470 
Stability criteria, 395 
Stagnation, point of, 298 
Stationarj' point, 144, 172 
Step, unit, 533, 541, 545, 551 
Stereographic projection, 262, 317, 376 
Stieltjes continued fraction, 403 
Stokes’s law, 214, 223, 265, 270 
Stratum, double, 357 
Struik, D. J., 113 
Submatrices, 48 
Summation, Cesaro, 285, 496 
Fourier series, 479 
Surface, associated with quadratic form, 134 
constant-value, 195 
curl, 217 
divergence, 206 
integration, 201, 204, 209, 215 
mapping, 195 

quadric {see Quadric surface) 

Riemann, 289, 310, 318 
visualization of, 315, 317, 321 
of source distribution, 204 
Sylvester’s law of nullity, 109 
Symmetrical determinant, 15 
matrix, 39, 115 
latent roots of, 121 
transformation, 121, 133 

T 

Tayloris series, 287, 305, 322, 323 
region of convergence, 287, 288 
relation to Laurent series, 290, 294 
Tensor, 92, 151, 187, 227 
components, 187 
fundamental metric, 92, 151 
matrix of, 187 
notation, 99 
order, 187 
valence, 187 
Thread, vortex, 216 
Time differentiation, 225, 439 
domain, 472, 519 
series (su Fourier series) 

Titchmarsh, £. C,, 339, 340 


Torque, 183, 208 

Transcendental functions, 297, 321 
Transform, Fourier, 473, 519 
inverse, 473, 519 
Hilbert, 339 

application to circuit theory, 346 
degenerate forms, 343 
Laplace, 548 

Transformation, bilinear, 363 
collineatory, 117, 137 
congruent, 136, 149, 157 
co-ordinate {see Co-ordinates, transforma- 
tion of) 

elemeniar\% of matrices, 54, 146 
elliptic, 368 

fractional (5ff Linear fractional trans- 
fer mationj 
group, 373 
homographic, 364 
hyperbolic, 367 

linear {see Linear transformation) 
matrix {see Linear transformation) 
orthogonal. 62. 81, 137, 140, 156, 166, 172. 
184. 234 

Schwarz-ChristofTcl, 378 
symmetrical. 121, 133 
vector, 186, 227 
Transient functions, 517 
convolution of, 530 
effective value of, 530 
mean value of, 530 
Transmission line, 441, 452, 555 
Transpose, of a matrix, 43, 46 
of a vector set, 77, 163 
Triangular form of determinant, 7 
matrix, 67, 152, 154 
wave, 471 

Trigonometric functions, 439 
orthogonality of, 444 
polynomial, 436 
series {see Fourier series) 

Triple product of vectors, scalar, 193, 198 
vector, 192 

Triplet, unit, 543, 546 
Tschebyscheff polynomials, 431 
Tube, 185, 230 

Turbulent field, 185, 186, 2b8, 215 
U 

Uniform convergence, 283 
abscissa of, 575 
Fcj6r polynomials, 500 
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Uniform convergence, Fourier series, 465 
tolerance, 502 
Uniqueness theorem, 290 
Unit circle, 284, 361, 369, 417, 419 
doublet, 542, 546 
impulse, 539, 545 
matrix, 38, 54 
step, 533, 541, 545, 551 
triplet, 543, 546 
vectors, 85, 167, 217 
in curvilinear co-ordinates, 235 
in oblique co-ordinates, 87 
scalar product of, 189 
vector product of, 192 

V 

Valence of a tensor, 187 
Variables, complex {see (,'om[)lex variables) 
functions of {set Functions of a complex 
variable) 

contragredient, 95 
Vector analysis, 183 
axial, 183 
constraint, 166 

contra variant and cova riant components, 
95, 159 

curl, 208, 213, 217, 229, 234, 243 
definition, 183 
del, 197 

derivative, 225, 232 
differential, 198, 201, 235 
differentiation, 225, 232 
direction cosines, 82, 101, 112, 162 
direction parameters, 126 
divergence, 200, 206, 228, 233, 242, 264 
field, irrotational, 185 
lamellar, 185 

nonturbulent, 185, 187, 213, 265 
potential, 185, 200, 218, 234, 263 
rotational, 185 
solenoidal, 185 

source-free, 185, 207, 218, 265 
turbulent, 185, 186, 208, 215 
force, 198, 208 
function, 218, 228, 263 
linear, 228 

potential, 218, 263, 266 
Gauss’s law, 203, 217, 264 
Gibbs notation, 188, 190 
Hamiltonian operator, 197, 203, 214 
interpretation of a complex number, 253 


Vector interpretatoin of a linear transformar 
tion, 79, 186, 227 

of a quadric surface, 134, 137, 145, 159 
moment, 126, 354 
normalization of, 89 

orthogonal, 81, 84, 107, 121, 168, 189, 190, 
266 
polar, 183 

product of vectors, cross, 190 
determinantal form of, 192, 194 
dot, 188 
inner, 188 

scalar, 80, 122, 184, 188, 228, 266 
scalar triple, 193, 198 
triple vector, 192 
vector, 184, 190, 228 
r, 231 

set, associated with quadric surface, 134, 
137, 145, 159 
inconsistent, 104 

linearly dependent, 77, 100, 114, 143 
linearly independent, 77, 145 
orthogonal, 81, 121, 168 
rank, 78, 101 
transposed, 77, 163 
Stokes’s law, 214, 223, 265, 270 
time-varying, 225 
unit, 85, 167, 217 
in curvilinear co-ordinates, 235 
in oblique co-ordinates, 87 
length, 88 
scalar product, 189 
vector product, 192 
useful relations, 228, 231 
use of ectors in interpreting linear equa- 
. lions, 79, 100, 132 
Versors, 127 

Vertex angles of a polygon, 385 
Vortex, definition, 186 
density, 208, 215, 221 
distribution, 215, 219, 291 
field, 186 

representation by a ^gularity, 265, 
351 

ring, 216 
thread, 216 

W 

u;-plane, 258 
Wave equation, 440 

proper values of, 441 
motion, 439 
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Weber, 507 
Winding point, 314 

Work, evaluation of, by line integral, 198, 208 
Wronskian, 25 


Z 


s-plane, 258 

associated complex sphere, 262 


Zeros, 297, 298, 323, 395 
conformal mapping near, 300 
detection of, 325, 408 
of a Hurwitz polynomial, 395 
multiplicity of, 298 
of a polynomial, 308, 324, 395 
of a positive real function, 411, 414 
separation property of, 400 






